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Abstract 

We review theory and applications of weak gravitational lensing. After summarising 
Friedmann-Lemaitre cosmological models, we present the formalism of gravitational lens- 
ing and light propagation in arbitrary space-times. We discuss how weak-lensing effects 
can be measured. The formalism is then applied to reconstructions of galaxy-cluster mass 
distributions, gravitational lensing by large-scale matter distributions, QSO-galaxy corre- 
lations induced by weak lensing, lensing of galaxies by galaxies, and weak lensing of the 
cosmic microwave background. 
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1 Introduction 



1 . 1 Gravitational Light Deflection 

Light rays are deflected when they propagate through an inhomogeneous gravita- 
tional field. Although several researchers had speculated about such an effect well 
before the advent of General Relativity (see Schneider et al. 1992 for a historical 
account), it was Einstein's theory which elevated the deflection of light by masses 
from a hypothesis to a firm prediction. Assuming light behaves like a stream of 
particles, its deflection can be calculated within Newton's theory of gravitation, but 
General Relativity predicts that the effect is twice as large. A light ray grazing the 
surface of the Sun is deflected by 1.75 arc seconds compared to the 0.87 arc sec- 
onds predicted by Newton's theory. The confirmation of the larger value in 1919 
was perhaps the most important step towards accepting General Relativity as the 
correct theory of gravity (Eddington 1920). 

Cosmic bodies more distant, more massive, or more compact than the Sun can bend 
light rays from a single source sufficiently strongly so that multiple light rays can 
reach the observer. The observer sees an image in the direction of each ray arriv- 
ing at their position, so that the source appears multiply imaged. In the language 
of General Relativity, there may exist more than one null geodesic connecting the 
world-line of a source with the observation event. Although predicted long before, 
the first multiple-image system was discovered only in 1979 (Walsh et al. 1979). 
From then on, the field of gravitational lensing developed into one of the most ac- 
tive subjects of astrophysical research. Several dozens of multiply-imaged sources 
have since been found. Their quantitative analysis provides accurate masses of, 
and in some cases detailed information on, the deflectors. An example is shown in 
Fig. 1. 

Tidal gravitational fields lead to differential deflection of light bundles. The size 
and shape of their cross sections are therefore changed. Since photons are neither 
emitted nor absorbed in the process of gravitational light deflection, the surface 
brightness of lensed sources remains unchanged. Changing the size of the cross 
section of a light bundle therefore changes the flux observed from a source. The 
different images in multiple-image systems generally have different fluxes. The 
images of extended sources, i.e. sources which can observationally be resolved, are 
deformed by the gravitational tidal field. Since astronomical sources like galaxies 
are not intrinsically circular, this deformation is generally very difficult to identify 
in individual images. In some cases, however, the distortion is strong enough to be 
readily recognised, most noticeably in the case of Einstein rings (see Fig. 2) and 
arcs in galaxy clusters (Fig. 3). 

If the light bundles from some sources are distorted so strongly that their images 
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Fig. 1. The gravitational lens system 2237+0305 consists of a nearby spiral galaxy at red- 
shift Zd = 0.039 and four images of a background quasar with redshift Zs = 1 -69. It was 
discovered by Huchra et al. (1985). The image was taken by the Hubble Space Telescope 
and shows only the innermost region of the lensing galaxy. The central compact source is 
the bright galaxy core, the other four compact sources are the quasar images. They differ in 
brightness because they are magnified by different amounts. The four images roughly fall 
on a circle concentric with the core of the lensing galaxy. The mass inside this circle can be 
determined with very high accuracy (Rix et al. 1992). The largest separation between the 
images is 1.8". 

appear as giant luminous arcs, one may expect many more sources behind a cluster 
whose images are only weakly distorted. Although weak distortions in individual 
images can hardly be recognised, the net distortion averaged over an ensemble of 
images can still be detected. As we shall describe in Sect. 2.3, deep optical expo- 
sures reveal a dense population of faint galaxies on the sky. Most of these galaxies 
are at high redshift, thus distant, and their image shapes can be utilised to probe the 
tidal gravitational field of intervening mass concentrations. Indeed, the tidal gravi- 
tational field can be reconstructed from the coherent distortion apparent in images 
of the faint galaxy population, and from that the density profile of intervening clus- 
ters of galaxies can be inferred (see Sect. 4). 

1 . 2 Weak Gravitational Lensing 

This review deals with weak gravitational lensing. There is no generally applica- 
ble definition of weak lensing despite the fact that it constitutes a flourishing area 

of research. The common aspect of all studies of weak gravitational lensing is that 
measurements of its effects are statistical in nature. While a single multiply-imaged 
source provides information on the mass distribution of the deflector, weak lensing 
effects show up only across ensembles of sources. One example was given above: 
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Fig. 2. The radio source MG 1131+0456 was discovered by Hewitt et al. (1988) as the 
first example of a so-called Einstein ring. If a source and an axially symmetric lens are 
co-aligned with the observer, the symmetry of the system permits the formation of a 
ring-like image of the source centred on the lens. If the symmetry is broken (as expected for 
all realistic lensing matter distributions), the ring is deformed or broken up, typically into 
four images (see Fig. 1). However, if the source is sufficiently extended, ring-like images 
can be formed even if the symmetry is imperfect. The 6 cm radio map of MG 1 131+0456 
shows a closed ring, while the ring breaks up at higher frequencies where the source is 
smaller. The ring diameter is 2.1". 

The shape distribution of an ensemble of galaxy images is changed close to a mas- 
sive galaxy cluster in the foreground, because the cluster's tidal field polarises the 
images. We shall see later that the size distribution of the background galaxy pop- 
ulation is also locally changed in the neighbourhood of a massive intervening mass 
concentration. 

Magnification and distortion effects due to weak lensing can be used to probe the 
statistical properties of the matter distribution between us and an ensemble of dis- 
tant sources, provided some assumptions on the source properties can be made. 
For example, if a standard candlS^ at high redshift is identified, its flux can be 

' The term standard candle is used for any class of astronomical objects whose intrin- 
sic luminosity can be inferred independently of the observed flux. In the simplest case, all 
members of the class have the same luminosity. More typically, the luminosity depends 
on some other known and observable parameters, such that the luminosity can be inferred 
from them. The luminosity distance to any standard candle can directly be inferred from the 
square root of the ratio of source luminosity and observed flux. Since the luminosity dis- 
tance depends on cosmological parameters, the geometry of the Universe can then directly 
be investigated. Probably the best current candidates for standard candles are supernovae 
of Type la. They can be observed to quite high redshifts, and thus be utilised to estimate 
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Fig. 3. The cluster Abell 2218 hosts one of the most impressive collections of arcs. This 
HST image of the cluster's central region shows a pattern of strongly distorted galaxy im- 
ages tangentially aligned with respect to the cluster centre, which lies close to the bright 
galaxy in the upper part of this image. The frame measures about 80" x 160". 

used to estimate the magnification along its line-of-sight. It can be assumed that 
the orientation of faint distant galaxies is random. Then, any coherent alignment of 
images signals the presence of an intervening tidal gravitational field. As a third ex- 
ample, the positions on the sky of cosmic objects at vastly different distances from 
us should be mutually independent. A statistical association of foreground objects 
with background sources can therefore indicate the magnification caused by the 
foreground objects on the background sources. 

All these effects are quite subtle, or weak, and many of the current challenges in 
the field are observational in nature. A coherent alignment of images of distant 
galaxies can be due to an intervening tidal gravitational field, but could also be due 
to propagation effects in the Earth's atmosphere or in the telescope. A variation 
in the number density of background sources around a foreground object can be 
due to a magnification effect, but could also be due to non-uniform photometry or 
obscuration effects. These potential systematic effects have to be controlled at a 
level well below the expected weak-lensing effects. We shall return to this essential 
point at various places in this review. 



1.3 Applications of Gravitational Lensing 



Gravitational lensing has developed into a versatile tool for observational cosmol- 
ogy. There are two main reasons: 



cosmological parameters (e.g. Riess et al. 1998). 
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(1) The deflection angle of a light ray is determined by the gravitational field of 
the matter distribution along its path. According to Einstein's theory of Gen- 
eral Relativity, the gravitational field is in turn determined by the stress-energy 
tensor of the matter distribution. For the astrophysically most relevant case of 
non-relativistic matter, the latter is characterised by the density distribution 
alone. Hence, the gravitational field, and thus the deflection angle, depend 
neither on the nature of the matter nor on its physical state. Light deflection 
probes the total matter density, without distinguishing between ordinary (bary- 
onic) matter or dark matter. In contrast to other dynamical methods for probing 
gravitational fields, no assumption needs to be made on the dynamical state of 
the matter. For example, the interpretation of radial velocity measurements in 
terms of the gravitating mass requires the applicability of the virial theorem 
(i.e., the physical system is assumed to be in virial equilibrium), or knowledge 
of the orbits (such as the circular orbits in disk galaxies). However, as will be 
discussed in Sect. 3, lensing measures only the mass distribution projected 
along the line-of- sight, and is therefore insensitive to the extent of the mass 
distribution along the light rays, as long as this extent is small compared to 
the distances from the observer and the source to the deflecting mass. Keeping 
this in mind, mass determinations by lensing do not depend on any symmetry 
assumptions. 

(2) Once the deflection angle as a function of impact parameter is given, gravi- 
tational lensing reduces to simple geometry. Since most lens systems involve 
sources (and lenses) at moderate or high redshift, lensing can probe the ge- 
ometry of the Universe. This was noted by Refsdal (1964), who pointed out 
that lensing can be used to determine the Hubble constant and the cosmic 
density parameter. Although this turned out later to be more difficult than 
anticipated at the time, first measurements of the Hubble constant through 
lensing have been obtained with detailed models of the matter distribution 
in multiple-image lens systems and the difference in light-travel time along 
the different light paths corresponding to different images of the source (e.g., 
Kundicetal. 1997; Schechter et al. 1997; Biggs et al. 1998). Since the vol- 
ume element per unit redshift interval and unit solid angle also depends on 
the geometry of space-time, so does the number of lenses therein. Hence, the 
lensing probability for distant sources depends on the cosmological parame- 
ters (e.g.. Press & Gunn 1973). Unfortunately, in order to derive constraints 
on the cosmological model with this method, one needs to know the evolu- 
tion of the lens population with redshift. Nevertheless, in some cases, sig- 
nificant constraints on the cosmological parameters (Kochanek 1993, 1996; 
Maoz & Rix 1993; Bartelmann et al. 1998; Falco et al. 1998), and on the evo- 
lution of the lens population (Mao & Kochanek 1994) have been derived from 
multiple-image and arc statistics. 



The possibility to directly investigate the dark-matter distribution led to sub- 
stantial results over recent years. Constraints on the size of the dark-matter 
haloes of spiral galaxies were derived (e.g., Brainerd et al. 1996), the pres- 
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ence of dark-matter haloes in elliptical galaxies was demonstrated (e.g., 
Maoz & Rix 1993; Griffiths et al. 1996), and the projected total mass distribution in 
many cluster of galaxies was mapped (e.g., Kneib et al. 1996; Hoekstra et al. 1998; 
Kaiser et al. 1998). These results directly impact on our understanding of structure 
formation, supporting hierarchical structure formation in cold dark matter (CDM) 
models. Constraints on the nature of dark matter were also obtained. Compact 
dark-matter objects, such as black holes or brown dwarfs, cannot be very abun- 
dant in the Universe, because otherwise they would lead to observable lensing ef- 
fects (e.g., Schneider 1993; Dalcanton et al. 1994). Galactic microlensing experi- 
ments constrained the density and typical mass scale of massive compact halo ob- 
jects in our Galaxy (see Paczyhski 1996, Roulet & MoUerach 1997 and Mao 2000 
for reviews). We refer the reader to the reviews by Blandford & Narayan (1992), 
Schneider (1996a) and Narayan & Bartelmann (1997) for a detailed account of the 
cosmological applications of gravitational lensing. 

We shall concentrate almost entirely on weak gravitational lensing here. Hence, 
the flourishing fields of multiple-image systems and their interpretation. Galactic 
microlensing and its consequences for understanding the nature of dark matter in 
the halo of our Galaxy, and the detailed investigations of the mass distribution 
in the inner parts of galaxy clusters through arcs, arclets, and multiply imaged 
background galaxies, will not be covered in this review. In addition to the refer- 
ences given above, we would like to point the reader to Refsdal & Surdej (1994), 
Fort & Mellier (1994), and Wu (1996) for more recent reviews on various aspects 
of gravitational lensing, to Mellier (1998) for a very recent review on weak lensing, 
and to the monograph (Schneider et al. 1992) for a detailed account of the theory 
and applications of gravitational lensing. 



1 . 4 Structure of this Review 

Many aspects of weak gravitational lensing are intimately related to the cosmo- 
logical model and to the theory of structure formation in the Universe. We there- 
fore start the review by giving some cosmological background in Sect. 2. After 
summarising Friedmann-Lemaitre-Robertson-Walker models, we sketch the the- 
ory of structure formation, introduce astrophysical objects like QSOs, galaxies, 
and galaxy clusters, and finish the Section with a general discussion of correla- 
tion functions, power spectra, and their projections. Gravitational light deflection 
in general is the subject of Sect. 3, and the specialisation to weak lensing is de- 
scribed in Sect. 4. One of the main aspects there is how weak lensing effects can be 
quantified and measured. The following two sections describe the theory of weak 
lensing by galaxy clusters (Sect. 5) and cosmological mass distributions (Sect. 6). 
Apparent correlations between background QSOs and foreground galaxies due to 
the magnification bias caused by large-scale matter distributions are the subject of 
Sect. 7. Weak lensing effects of foreground galaxies on background galaxies are 
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reviewed in Sect. 8, and Sect. 9 finally deals with weak lensing of the most distant 
and most extended source possible, i.e. the Cosmic Microwave Background. We 
present a brief summary and an outlook in Sect. 10. 

We use standard astronomical units throughout: IMq = 1 solar mass = 2 x 10^^ g; 
IMpc = Imegaparsec = 3.1 x 10^"^ cm. 
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2 Cosmological Background 

We review in this section those aspects of the standard cosmological model which 
are relevant for our further discussion of weak gravitational lensing. This standard 

model consists of a description for the cosmological background which is a homo- 
geneous and isotropic solution of the field equations of General Relativity, and a 
theory for the formation of structure. 

The background model is described by the Robertson- Walker metric 
(Robertson 1935; Walker 1935), in which hypersurfaces of constant time are 
homogeneous and isotropic three-spaces, either flat or curved, and change with 
time according to a scale factor which depends on time only. The dynamics of the 
scale factor is determined by two equations which follow from Einstein's field 
equations given the highly symmetric form of the metric. 

Current theories of structure formation assume that structure grows via gravita- 
tional instability from initial seed perturbations whose origin is yet unclear. Most 
common hypotheses lead to the prediction that the statistics of the seed fluctua- 
tions is Gaussian. Their amplitude is low for most of their evolution so that lin- 
ear perturbation theory is sufficient to describe their growth until late stages. For 
general references on the cosmological model and on the theory of structure for- 
mation, cf. Weinberg (1972), Misner et al. (1973), Peebles (1980), Bomer (1988), 
Padmanabhan (1993), Peebles (1993), and Peacock (1999). 

2. 1 Friedmann-Lemaitre Cosmological Models 
2.1.1 Metric 

Two postulates are fundamental to the standard cosmological model, which are: 

(1) When averaged over sufficiently large scales, there exists a mean motion of 
radiation and matter in the Universe with respect to which all averaged ob- 
servable properties are isotropic. 

(2) All fundamental observers, i.e. imagined observers which follow this mean 
motion, experience the same history of the Universe, i.e. the same averaged 
observable properties, provided they set their clocks suitably. Such a universe 
is called observer-homogeneous. 

General Relativity describes space-time as a four-dimensional manifold whose met- 
ric tensor is considered as a dynamical field. The dynamics of the metric 
is governed by Einstein's field equations, which relate the Einstein tensor to the 
stress-energy tensor of the matter contained in space-time. Two events in space- 
time with coordinates differing by djc" are separated by ds, with d*^ = g(xp dx^djcP. 
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The eigentime (proper time) of an observer who travels by ds changes by c~^ds. 
Greek indices run over ... 3 and Latin indices run over the spatial indices 1 ... 3 
only. 

The two postulates stated above considerably constrain the admissible form of the 
metric tensor. Spatial coordinates which are constant for fundamental observers are 
called comoving coordinates. In these coordinates, the mean motion is described by 
dx' = 0, and hence ds^ = goodt^. If we require that the eigentime of fundamental 
observers equal the cosmic time, this implies goo = c^- 

Isotropy requires that clocks can be synchronised such that the space-time compo- 
nents of the metric tensor vanish, gQi — 0. If this was impossible, the components of 
gQi identified one particular direction in space-time, violating isotropy. The metric 
can therefore be written 

d/ = c^dp- + gij dx' dx^ , (2. 1) 

where gij is the metric of spatial hypersurfaces. In order not to violate isotropy, 
the spatial metric can only isotropically contract or expand with a scale function 
a{t) which must be a function of time only, because otherwise the expansion would 
be different at different places, violating homogeneity. Hence the metric further 
simplifies to 

ds^ = c^dt^-a^(t)dl^ , (2.2) 

where dl is the line element of the homogeneous and isotropic three-space. A spe- 
cial case of the metric (2.2) is the Minkowski metric, for which dl is the Euclidian 
line element and a{t) is a constant. Homogeneity also implies that all quantities 
describing the matter content of the Universe, e.g. density and pressure, can be 
functions of time only. 

The spatial hypersurfaces whose geometry is described by dl^ can either be flat or 
curved. Isotropy only requires them to be spherically symmetric, i.e. spatial sur- 
faces of constant distance from an arbitrary point need to be two-spheres. Homo- 
geneity permits us to choose an arbitrary point as coordinate origin. We can then in- 
troduce two angles 6, (j) which uniquely identify positions on the unit sphere around 
the origin, and a radial coordinate w. The most general admissible form for the 
spatial line element is then 

d/2 = dw^ +/|(w) (d^2 + sin^ 0d02) = dw^ +/|(w) do)^ . (2.3) 

Homogeneity requires that the radial function is either a trigonometric, lin- 

ear, or hyperbolic function of w, depending on whether the curvature K is positive. 



13 



zero, or negative. Specifically, 



Mw) = < 



{K>0) 
{K = 0) . 
(-is:)-V2sinh[(-i5:)V2vi;] (K < 0) 



(2.4) 



Note that //f (w) and thus \K\ have the dimension of a length. If we define the 
radius r of the two-spheres by /j5:(w) = r, the metric d/^ takes the alternative form 



d/2 = 



2,2 



(2.5) 



2.i.2 Redshift 

Due to the expansion of space, photons are redshifted while they propagate from 
the source to the observer. Consider a comoving source emitting a light signal at 
?e which reaches a comoving observer at the coordinate origin w = at time to- 
Since d^ = for light, a backward-directed radial light ray propagates according to 
|cd?| = adw, from the metric. The (comoving) coordinate distance between source 
and observer is constant by definition, 

/•e rto{ts) (.^f 

Weo = / dw = = constant , (2.6) 

Jo Jtt ^ 

and thus in particular the derivative of Wgo with respect to t^, is zero. It then follows 
from eq. (2.6) 

d?o ^ «(fo) ^2 7) 

d?e «(?e) 

Identifying the inverse time intervals ( d?e,o)~^ with the emitted and observed light 
frequencies Ve,o, we can write 

^ = ^-1 = ^. (2.8) 

dfe Vo Ae 

Since the redshift z is defined as the relative change in wavelength, or 1 + z = A,o A," \ 
we find 

aitn) 

This shows that light is redshifted by the amount by which the Universe has ex- 
panded between emission and observation. 
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2.1.3 Expansion 



To complete the description of space-time, we need to know how the scale func- 
tion a{t) depends on time, and how the curvature K depends on the matter which 
fills space-time. That is, we ask for the dynamics of the space-time. Einstein's field 
equations relate the Einstein tensor G„p to the stress-energy tensor r^p of the mat- 
ter, 

SttG 

<^ap = Taf, + Aga^ . (2. 10) 

The second term proportional to the metric tensor gg.^ is a generalisation intro- 
duced by Einstein to allow static cosmological solutions of the field equations. A 
is called the cosmological constant. For the highly symmetric form of the metric 
given by (2.2) and (2.3), Einstein's equations imply that r„p has to have the form 
of the stress-energy tensor of a homogeneous perfect fluid, which is characterised 
by its density p{t) and its pressure p{t). Matter density and pressure can only de- 
pend on time because of homogeneity. The field equations then simplify to the two 
independent equations 

dV S%G Kc^ A 

P- — + T (2-11) 



and 



a I 3 fl^ 3 



d 4 3p\ A 



--7lG(p + ^ )+-. (2.12) 



The scale factor a{t) is determined once its value at one instant of time is fixed. We 
choose a = 1 at the present epoch to. Equation (2. 1 1) is called Friedmann 's equation 
(Friedmann 1922, 1924). The two equations (2.11) and (2.12) can be combined to 
yield the adiabatic equation 

l[a\t)p{ty]+p{t)^ = Q, (2.13) 

which has an intuitive interpretation. The first term a^p is proportional to the energy 
contained in a fixed comoving volume, and hence the equation states that the change 
in 'internal' energy equals the pressure times the change in proper volume. Hence 
eq. (2.13) is the first law of thermodynamics in the cosmological context. 

A metric of the form given by eqs. (2.2), (2.3), and (2.4) is called the Robertson- 
Walker metric. If its scale factor a{t) obeys Friedmann's equation (2.11) and the 
adiabatic equation (2.13), it is called the Friedmann-Lemaitre-Robertson- Walker 

metric, or the Friedmann-Lemaitre metric for short. Note that eq. (2.12) can also 
be derived from Newtonian gravity except for the pressure term in (2.12) and the 
cosmological constant. Unlike in Newtonian theory, pressure acts as a source of 
gravity in General Relativity. 
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2.1.4 Parameters 



The relative expansion rate doT^ =H is called the Hubble parameter, and its value 
at the present epoch ? = ?o is the Hubble constant, H{to)=HQ. It has the dimension 
of an inverse time. The value of Hq is still uncertain. Current measurements roughly 
fall into the range Hq = (50 — 80) km s^^ Mpc^^ (see Freedman 1996 for a review), 
and the uncertainty in Hq is commonly expressed as Hq — lOO/ikm s~^ Mpc~\ 
with /i= (0.5 -0.8). Hence 

Ho ^ 3.2 X IQ-i^/js-i ^ 1.0 X 10-^°/jyr-i . (2.14) 

The time scale for the expansion of the Universe is the inverse Hubble constant, or 
Hq^ ^ lO^O/j-i years. 

The combination 

3H^ 

—^=p,,^ 1.9 X 10-29 /j2gcm-3 (2.15) 

is the critical density of the Universe, and the density po in units of per is the density 
parameter Q.q, 

^0 = — . (2.16) 

Per 

If the matter density in the universe is critical, po = per or = 1> and if the cos- 
mological constant vanishes, A = 0, spatial hypersurfaces are flat, ^ = 0, which 
follows from (2.1 1) and will become explicit in eq. (2.30) below. We further define 

^A^A- (2.17) 
5Hq 

The deceleration parameter qQ is defined by 

qo = -^ (2.18) 

a^ 

att = to. 



2.1.5 Matter Models 

For a complete description of the expansion of the Universe, we need an equation 
of state p = p{p), relating the pressure to the energy density of the matter. Ordinary 
matter, which is frequently called dust in this context, has p <C pc^, while p = pc^/3 
for radiation or other forms of relativistic matter. Inserting these expressions into 
eq. (2.13), we find 

p(0 = a-"(Opo, (2.19) 
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with 



3 for dust, p = 
n={ . (2.20) 

4 for relativistic matter, p = 

The energy density of relativistic matter therefore drops more rapidly with time 
than that of ordinary matter. 



2.1.6 Relativistic Matter Components 

There are two obvious candidates for relativistic matter today, photons and neutri- 
nos. The energy density contained in photons today is determined by the temper- 
ature of the Cosmic Microwave Background, Tcmb = 2.73 K (Fixsen et al. 1996). 
Since the CMB has an excellent black-body spectrum, its energy density is given 
by the Stefan-Boltzmann law. 

In terms of the cosmic density parameter Q.q [eq. (2.16)], the cosmic density con- 
tributed by the photon background is 

i^CMB,o = 2.4x10-5/1-2. (2.22) 



Like photons, neutrinos were produced in thermal equilibrium in the hot early phase 
of the Universe. Interacting weakly, they decoupled from the cosmic plasma when 
the temperature of the Universe was kT ^ I MeV because later the time-scale of 
their leptonic interactions became larger than the expansion time-scale of the Uni- 
verse, so that equilibrium could no longer be maintained. When the temperature 
of the Universe dropped to kT fa 0.5 MeV, electron-positron pairs annihilated to 
produce y rays. The annihilation heated up the photons but not the neutrinos which 
had decoupled earlier. Hence the neutrino temperature is lower than the photon 
temperature by an amount determined by entropy conservation. The entropy of 
the electron-positron pairs was dumped completely into the entropy of the photon 
background Sy. Hence, 

(i^e + '^Y)before = ('^Y)after > (2.23) 

where "before" and "after" refer to the annihilation time. Ignoring constant factors, 
the entropy per particle species is 5 gT^, where g is the statistical weight of 
the species. For bosons g — I, and for fermions g — 1 /% per spin state. Before 
annihilation, we thus have gbefore = 

4 • 7/8 -h 2 = 1 1 /2, while after the annihilation 
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g = 2 because only photons remain. From eq. (2.23), 



Tafter\ U ^^.24) 



before / 4 

After the annihilation, the neutrino temperature is therefore lower than the photon 
temperature by the factor (11/4)^/^. In particular, the neutrino temperature today 
is 

7^,0 =fYT) ^CMB = 1.95K. (2.25) 

Although neutrinos have long been out of thermal equilibrium, their distribution 

function remained unchanged since they decoupled, except that their temperature 
gradually dropped in the course of cosmic expansion. Their energy density can thus 
be computed from a Fermi-Dirac distribution with temperature Ty, and be converted 
to the equivalent cosmic density parameter as for the photons. The result is 



nvo = 2.8x IQ-^/i-^ (2.26) 



per neutrino species. 



Assuming three relativistic neutrino species, the total density parameter in relativis- 
tic matter today is 

^Rfi = ^CMBfi + 3 X nv,o = 3.2 X 10-^ h-^ . (2.27) 

Since the energy density in relativistic matter is almost five orders of magnitude 
less than the energy density of ordinary matter today if Q.o is of order unity, the 
expansion of the Universe today is matter-dominated, or p = a~^{t)po. The energy 
densities of ordinary and relativistic matter were equal when the scale factor a{t) 
was 



a = ^ = 3.2 X 10-^ ^0 1 h-^ , (2.28) 



and the expansion was radiation-dominated at yet earlier times, p = a~^pQ. The 
epoch of equality of matter and radiation density will turn out to be important for 
the evolution of structure in the Universe discussed below. 



2.1.7 Spatial Curvature and Expansion 

With the parameters defined previously, Friedmann's equation (2. 1 1) can be written 



(2.29) 
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Since H{to) = Hq, and Q.Rfl <^ Q.q, eq. (2.29) implies 



+ (2.30) 

and eq. (2.29) becomes 

H^{t)=Hl [a-\t)Q.^^o + a-\t)ClQ + a-^{t){\-£lQ-£lK)+ClK] . (2.31) 

The curvature of spatial hypersurfaces is therefore determined by the sum of the 
density contributions from matter, Q.q, and from the cosmological constant, ^2a. 
If ^0 + = 1> space is flat, and it is closed or hyperbolic if Q.q + is larger 
or smaller than unity, respectively. The spatial hypersurfaces of a low-density uni- 
verse are therefore hyperbolic, while those of a high-density universe are closed 
[cf. eq. (2.4)]. A Friedmann-Lemaitre model universe is thus characterised by four 
parameters: the expansion rate at present (or Hubble constant) Hq, and the density 
parameters in matter, radiation, and the cosmological constant. 

Dividing eq. (2.12) by eq. (2.11), using eq. (2.30), and setting p = 0, we obtain for 
the deceleration parameter 

qQ = ^-a^. (2.32) 



The age of the universe can be determined from eq. (2.31). Since dt — dad — 
da{aH)~^, we have, ignoring Q-r^, 

^o = 7r [' dfl [fl-ino + (l-no-nA)+«^^2A]~^^^ • (2.33) 

Hq Jo ^ 

It was assumed in this equation that p = holds for all times t, while pressure is not 
negligible at early times. The corresponding error, however, is very small because 
the universe spends only a very short time in the radiation-dominated phase where 
p>0. 

Figure 4 shows to in units of Hq^ as a function of Qo, for ^^a = (solid curve) and 
D.A = 1 — (dashed curve). The model universe is older for lower D.o and higher 
^A because the deceleration decreases with decreasing Q.o and the acceleration 
increases with increasing ^a- 

In principle, D.^ can have either sign. We have restricted ourselves in Fig. 4 to non- 
negative Q.A because the cosmological constant is usually interpreted as the energy 
density of the vacuum, which is positive semi-definite. 

The time evolution (2.31) of the Hubble function H{t) allows one to determine the 
dependence of Q. and ^^a on the scale function a. For a matter-dominated universe, 
we find 
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Fig. 4. Cosmic age to in units of //q ' as a function of Q.o, for Q.\ = (solid curve) and 
D.A = 1 — Ho (dashed curve). 



nA(fl) : 



^^0 



3H^{a) 
A 



(2.34) 



These equations show that, whatever the values of Q.o and are at the present 
epoch, i2(a) — > 1 and Q.a — > for a ^ 0. This impUes that for sufficiently early 
times, all matter-dominated Friedmann-Lemaitre model universes can be described 
by Einstein-de Sitter models, for which ^ = and = 0. For a <^ \, the right- 
hand side of Friedmann's equation (2.31) is therefore dominated by the matter and 
radiation terms because they contain the strongest dependences on ^ The Hubble 
function H{t) can then be approximated by 



(2.35) 



Using the definition of a^q, a^qCl^^fi = a^^Clo [cf. eq. (2.28)], eq. (2.35) can be 
written 



,-3/ 



aea\l/2 



H{t)=Hoa'J'a-y^ (l + ^) 



(2.36) 
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Hence, 



H{t)=HoaQ 



1/2 



aeq^ a ^ (a < <3eq) 

Likewise, the expression for the cosmic time reduces to 



,3/2 (^i_2^) (i + ^)^/V2a 



3/2 



or 



(2.37) 



(2.38) 



(2.39) 



Equation (2.36) is called the Einstein-de Sitter limit of Friedmann's equation. 
Where not mentioned otherwise, we consider in the following only cosmic epochs 
at times much later than t^q, i.e., when a » a^q, where the Universe is dominated 
by dust, so that the pressure can be neglected, p = 0. 



2.1.8 Necessity of a Big Bang 

Starting from a = 1 at the present epoch and integrating Friedmann's equation 
(2.11) back in time shows that there are combinations of the cosmic parameters 
such that a > at all times. Such models would have no Big Bang. The neces- 
sity of a Big Bang is usually inferred from the existence of the cosmic microwave 
background, which is most naturally explained by an early, hot phase of the Uni- 
verse. Borner & Ehlers (1988) showed that two simple observational facts suffice 
to show that the Universe must have gone through a Big Bang, if it is properly de- 
scribed by the class of Friedmann-Lemaitre models. Indeed, the facts that there are 
cosmological objects at redshifts z > 4, and that the cosmic density parameter of 
non-relativistic matter, as inferred from observed galaxies and clusters of galaxies 
is > 0.02, exclude models which have a{t) > at all times. Therefore, if we 
describe the Universe at large by Friedmann-Lemaitre models, we must assume a 
Big Bang, or a = at some time in the past. 



2.1.9 Distances 

The meaning of "distance" is no longer unique in a curved space-time. In contrast 
to the situation in Euclidian space, distance definitions in terms of different mea- 
surement prescriptions lead to different distances. Distance measures are therefore 
defined in analogy to relations between measurable quantities in Euclidian space. 
We define here four different distance scales, the proper distance, the comoving 
distance, the angular-diameter distance, and the luminosity distance. 
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Distance measures relate an emission event and an observation event on two sep- 
arate geodesic lines which fall on a common light cone, either the forward light 
cone of the source or the backward light cone of the observer. They are therefore 
characterised by the times ?2 and ti of emission and observation respectively, and 
by the structure of the light cone. These times can uniquely be expressed by the 
values a2 — a{t2) and a\ — a{ti) of the scale factor, or by the redshifts zi and z\ 
corresponding to a2 and a\. We choose the latter parameterisation because red- 
shifts are directly observable. We also assume that the observer is at the origin of 
the coordinate system. 

The proper distance £)prop (21,22) is the distance measured by the travel time of 
a light ray which propagates from a source at zi to an observer at z\ < zi- It is 
defined by dDprop = —c&t, hence dDprop = —cdacT^ — —cda{aH)^^. The minus 
sign arises because, due to the choice of coordinates centred on the observer, dis- 
tances increase away from the observer, while the time t and the scale factor a 
increase towards the observer. We get 

Dprop{zi,Z2) = ^ r^''^ [a-'^Q + (1 - - ^^a) + «^^^a] dfl . (2.40) 

-"0 Ja{z2) 



The comoving distance £)com(zi , 22) is the distance on the spatial hypersurface t = tQ 
between the worldlines of a source and an observer comoving with the cosmic flow. 
Due to the choice of coordinates, it is the coordinate distance between a source at Z2 
and an observer at z\, dDcom = <dw. Since light rays propagate with d* = 0, we have 
edit = —adw from the metric, and therefore dDcom = —a^^cdt — —cda{ad)~^ = 
—cda{a^H)~^. Thus 



c f^i^i) 

Dcom{zi,Z2) = TT / [fli^O + « ( 1 - ^^0 - ^^a) + « a] 

no Jaizi) 



'■^(zi) . , . ^-1/2, 

da 

2(12) 

■■w{zuZ2). (2.41) 



The angular- diameter distance ^ang (21,22) is defined in analogy to the relation in 
Euclidian space between the physical cross section 5A of an object at Z2 and the 
solid angle 5to that it subtends for an observer at zi, 5toD^g = 5A. Hence, 

SA 

(2.42) 



47la2(22)/2[w(zi,Z2)] 471 



where a{z2) is the scale factor at emission time and /a: [1^(21,22)] is the radial coor- 
dinate distance between the observer and the source. It follows 

/5^\ 1/2 

i)ang(2l,22) = ( g^J = ^(22) /j^[>v(2l , 22)] • (2.43) 
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According to the definition of the comoving distance, the angular-diameter distance 
therefore is 

£>ang(zi,Z2) = a{z2) MDcomiZhZl)] ■ (2.44) 



The luminosity distance Aum(«l,«2) is defined by the relation in Euclidian space 
between the luminosity L of an object at zi and the flux S received by an observer 
at zi. It is related to the angular-diameter distance through 

) Ang(zi,Z2)= . . fK[Dcom{zi,Z2)] ■ (2.45) 

a[Z2) J a[Z2) 

The first equality in (2.45), which is due to Etherington (1933), is valid in ar- 
bitrary space-times. It is physically intuitive because photons are redshifted by 
a{zi)a{z2)^^ , their arrival times are delayed by another factor a{zi)a{z2)^^ , and 
the area of the observer's sphere on which the photons are distributed grows be- 
tween emission and absorption in proportion to [a{zi)a{z2)^^]^- This accounts for 
a total factor of [a{zi)a{z2)~^]'^ in the flux, and hence for a factor of [fl(zi)fl(z2)~^]^ 
in the distance relative to the angular-diameter distance. 

We plot the four distances Z)prop> ^com> ^ang> and Z)ium for Zi = as a function of z 
in Fig. 5. 

The distances are larger for lower cosmic density and higher cosmological constant. 
Evidently, they differ by a large amount at high redshift. For small redshifts, z <^ 1, 
they all follow the Hubble law, 

distance = — + 0{z^) . (2.46) 
Ho 



2.1.10 The Einstein-de Sitter Model 

In order to illustrate some of the results obtained above, let us now specialise 

to a model universe with a critical density of dust, f^o = 1 and p = 0, and 
with zero cosmological constant, = 0. Friedmann's equation then reduces to 
H{t) — Hqo'^I^, and the age of the Universe becomes tQ — 2(3//o)^^ - The distance 
measures are 
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Fig. 5. Four distance measures are plotted as a function of source redshift for two cosmo- 
logical models and an observer at redshift zero. These are the proper distance £>prop (1, solid 
line), the comoving distance Dcom (2, dotted line), the angular-diameter distance Dang (3, 
short-dashed line), and the luminosity distance Aum (4, long-dashed hne). 
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(2.47) 



2.2 Density Perturbations 



The standard model for the formation of structure in the Universe assumes that 
there were small fluctuations at some very early initial time, which grew by gravi- 
tational instability. Although the origin of the seed fluctuations is yet unclear, they 
possibly originated from quantum fluctuations in the very early Universe, which 
were blown up during a later inflationary phase. The fluctuations in this case are 
uncorrelated and the distribution of their amplitudes is Gaussian. Gravitational in- 
stability leads to a growth of the amplitudes of the relative density fluctuations. As 
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long as the relative density contrast of the matter fluctuations is much smaller than 
unity, they can be considered as small perturbations of the otherwise homogeneous 
and isotropic background density, and linear perturbation theory suffices for their 
description. 

The linear theory of density perturbations in an expanding universe is gener- 
ally a complicated issue because it needs to be relativistic (e.g. Lifshitz 1946; 
Bardeen 1980). The reason is that perturbations on any length scale are compa- 
rable to or larger than the size of the horizon^ at sufficiently early times, and 
then Newtonian theory ceases to be applicable. In other words, since the hori- 
zon scale is comparable to the curvature radius of space-time, Newtonian theory 
fails for larger-scale perturbations due to non-zero spacetime curvature. The main 
features can nevertheless be understood by fairly simple reasoning. We shall not 
present a rigourous mathematical treatment here, but only quote the results which 
are relevant for our later purposes. For a detailed qualitative and quantitative dis- 
cussion, we refer the reader to the excellent discussion in chapter 4 of the book by 
Padmanabhan (1993). 

2.2.1 Horizon Size 

The size of causally connected regions in the Universe is called the horizon size. 
It is given by the distance by which a photon can travel in the time t since the Big 
Bang. Since the appropriate time scale is provided by the inverse Hubble parameter 
H^^{a), the horizon size is d[i = cH^^{a), and the comoving horizon size is 

rf„ = ^ = ^a->/^../^(,+f^)-"\ (2.48) 

aH[a) Hq ^ \ a J 

where we have inserted the Einstein-de Sitter limit (2.36) of Friedmann's equation. 
The length cHq^ = 3h^^ Gpc is called the Hubble radius. We shall see later that 
the horizon size at a^q plays a very important role for structure formation. Inserting 
a = Ueq into eq. (2.48), yields 

^^H(aeq) = ^ U{Qoh^r'Mpc , (2.49) 

where a^q from eq. (2.28) has been inserted. 

2.2.2 Linear Growth of Density Perturbations 

We adopt the commonly held view that the density of the Universe is dominated 
by weakly interacting dark matter at the relatively late times which are relevant for 

^ In this context, the size of the horizon is the distance ct by which light can travel in the 
time t since the big bang. 
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weak gravitational lensing, a^a^q. Dark-matter perturbations are characterised by 
the density contrast 

p(x,a)-p(a) 
0{x,a) = r-^^^ , (2.50) 

where p = po^^'^ is the average cosmic density. Relativistic and non-relativistic 
perturbation theory shows that linear density fluctuations, i.e. perturbations with 
5^1, grow like 

r, 1 before agn 
5(a) oc a"-2 = <J ^ (2.51) 

a after Ogq 

as long as the Einstein-de Sitter limit holds. For later times, a a^q, when the 
Einstein-de Sitter limit no longer applies if 7^ 1 or Q.a 0, the linear growth of 
density perturbations is changed according to 

5(a) = 6o a = 6o ag(a) , (2.52) 
^ (1) 

where 5o is the density contrast linearly extrapolated to the present epoch, and the 
density-dependent growth function g'{a) is accurately fit by (Carroll et al. 1992) 



-1 



(2.53) 



The dependence of Q. and on the scale factor a is given in eqs. (2.34). The 
growth function ag{a;Q.o,Q.A) is shown in Fig. 6 for a variety of parameters Q.o 
and 

The cosmic microwave background reveals relative temperature fluctuations of or- 
der 10^^ on large scales. By the Sachs-Wolfe effect (Sachs & Wolfe 1967), these 
temperature fluctuations reflect density fluctuations of the same order of magnitude. 
The cosmic microwave background originated at a pa 10^ a^q, well after the 
Universe became matter-dominated. Equation (2.51) then implies that the density 
fluctuations today, expected from the temperature fluctuations at a ^ 10~^, should 
only reach a level of 10^^. Instead, structures (e.g. galaxies) with 6^1 are ob- 
served. How can this discrepancy be resolved? The cosmic microwave background 
displays fluctuations in the baryonic matter component only. If there is an addi- 
tional matter component that only couples through weak interactions, fluctuations 
in that component could grow as soon as it decoupled from the cosmic plasma, well 
before photons decoupled from baryons to set the cosmic microwave background 
free. Such fluctuations could therefore easily reach the amplitudes observed today, 
and thereby resolve the apparent mismatch between the amplitudes of the tem- 
perature fluctuations in the cosmic microwave background and the present cosmic 
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Fig. 6. The growth function ag{a) = ag'{a)/g'{\) given in eqs. (2.52) and (2.53) for Q.q 
between 0.2 and 1.0 in steps of 0.2. Top panel: Ha = 0; bottom panel: Q.\ = 1 — CIq. The 
growth rate is constant for the Einstein-de Sitter model {Q.q = 1, Q.^ = 0), while it is higher 
for a ^ 1 and lower for a 1 for Iow-Hq models. Consequently, structure forms earlier in 
low- than in high-Qo models. 

structures. This is one of the strongest arguments for the existence of a dark matter 
component in the Universe. 

2.2.3 Suppression of Growth 

It is convenient to decompose the density contrast 5 into Fourier modes. In linear 
perturbation theory, individual Fourier components evolve independently. A pertur- 
bation of (comoving) wavelength X is said to "enter the horizon" when X = dii{a). 
If X < d}i{aeq), the perturbation enters the horizon while radiation is still dominat- 
ing the expansion. Until a^q, the expansion time-scale, ?exp = H ^ , is determined by 
the radiation density Pr, which is shorter than the collapse time-scale of the dark 
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matter, tDu- 



rexp~(GpR)-l/2<(GpDM)~^/^ 



(2.54) 



In other words, the fast radiation-driven expansion prevents dark-matter perturba- 
tions from collapsing. Light can only cross regions that are smaller than the hori- 
zon size. The suppression of growth due to radiation is therefore restricted to scales 
smaller than the horizon, and larger-scale perturbations remain unaffected. This 
explains why the horizon size at a^q, daicieq), sets an important scale for structure 
growth. 




enter 



Fig. 7. Sketch illustrating the suppression of structure growth during the radia- 
tion-dominated phase. The perturbation grows <x before aeq, and <x a thereafter. If the 
perturbation is smaller than the horizon at a^, it enters the horizon at Center < (^eq while 
radiation is still dominating. The rapid radiation-driven expansion prevents the perturba- 
tion from growing further. Hence it stalls until Ogq. By then, its amplitude is smaller by 
/sup = (<2enter/«eq)^ than it would be without suppression. 

Figure 7 illustrates the growth of a perturbation with A, < <iH(<3eq), that is small 
enough to enter the horizon at Oenter < «eq- It can be read off from the figure that 
such perturbations are suppressed by the factor 

/sup=(^)'. (2.55) 



It remains to be evaluated at what time aenter a density perturbation with comoving 
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wavelength A, enters the horizon. The condition is 



X = t^H (tenter) = 777 T • (2.56) 

(Center " [(^enter ) 

Well in the Einstein-de Sitter regime, the Hubble parameter is given by eq. (2.37). 
Inserting that expression into (2.56) yields 

, , Center («enter <S «eq) 
^enter (^eq ^ ^^enter 

Let now ^ = be the wave number of the perturbation, and kQ = d^^ (aeq) the 
wave number corresponding to the horizon size at a^q. The suppression factor (2.55) 
can then be written 

^0^ ^ 



fsnp=[jj ■ (2.58) 
From eq. (2.49), 

ko ^ 0.083 {aoh^) Mpc-i ^ 250 (aoh) (Hubble radii)" ^ . (2.59) 



2.2.4 Density Power Spectrum 

The assumed Gaussian density fluctuations 5(x) at the comoving position x can 
completely be characterised by their power spectrum P8{k), which can be defined 
by (see Sect. 2.4) 

^5(^)5* (^O) = {2Tif^D{k-k')P8{k) , (2.60) 

where 5(^) is the Fourier transform of 5, and the asterisk denotes complex con- 
jugation. Strictly speaking, the Fourier decomposition is valid only in flat space. 
However, at early times space is flat in any cosmological model, and at late times 
the interesting scales k~^ of the density perturbations are much smaller than the 
curvature radius of the Universe. Hence, we can apply Fourier decomposition here. 

Consider now the primordial perturbation spectrum at some very early time. Pi (k) = 
|5?(^)|. Since the density contrast grows as 5 oc a"~^ [eq. (2.51)], the spectrum 
grows as P^ik) oc a^^'^~'^\ At Oenter, the spectrum has therefore changed to 

Penter(/:) - aH;,'''^ Pi{k) oc k-"" Pi{k) . (2.61) 

where eq. (2.57) was used for k » ^o- 
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It is commonly assumed that the total power of the density fluctuations at Center 
should be scale-invariant. This implies Peniei{k) = const., or Pcmcrik) °^ k^^. Ac- 
cordingly, the primordial spectrum has to scale with k as Pi{k) oc k. This scale- 
invariant spectrum is called the Harrison-Zel'dovich spectrum (Harrison 1970; 
Peebles & Yu 1970; Zel'dovich 1972). Combining that with the suppression of 
small-scale modes (2.58), we arrive at 



An additional complication arises when the dark matter consists of particles moving 
with a velocity comparable to the speed of light. In order to keep them gravitation- 
ally bound, density perturbations then have to have a certain minimum mass, or 
equivalently a certain minimum size. All perturbations smaller than that size are 
damped away by free streaming of particles. Consequently, the density perturba- 
tion spectrum of such particles has an exponential cut-off at large k. This clarifies 
the distinction between hot and cold dark matter: Hot dark matter (HDM) consists 
of fast particles that damp away small-scale perturbations, while cold dark matter 
(CDM) particles are slow enough to cause no significant damping. 

2.2.5 Normalisation of the Power Spectrum 

Apart from the shape of the power spectrum, its normalisation has to be fixed. 
Several methods are available which usually yield different answers: 

(1) Normalisation by microwave-background anisotropics: The COBE satellite 
has measured fluctuations in the temperature of the microwave sky at the rms 
level of Ar/r ~ 1.3 X 10~^ at an angular scale of ~ 7° (Banday et al. 1997). 
Adopting a shape for the power spectrum, these fluctuations can be translated 
into an amplitude for P^{k) . Due to the large angular scale of the measurement, 
this kind of amplitude determination specifies the amplitude on large physical 
scales (small k) only. In addition, microwave-background fluctuations mea- 
sure the amplitude of scalar and tensor perturbation modes, while the growth 
of density fluctuations is determined by the fluctuation amplitude of scalar 
modes only. 

(2) Normalisation by the local variance of galaxy counts, pioneered by 
Davis & Peebles (1983): Galaxies are supposed to be biased tracers 
of underlying dark-matter fluctuations (Kaiser 1984; Bardeen et al. 1986; 
White et al. 1987). By measuring the local variance of galaxy counts within 
certain volumes, and assuming an expression for the bias, the amplitude 
of dark-matter fluctuations can be inferred. Conventionally, the variance of 
galaxy counts Og.gaiaxies is measured within spheres of radius 8/z^^Mpc, and 
the result is Os, galaxies ~ 1- The problem of finding the corresponding variance 




(2.62) 
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Og of matter-density fluctuations is that the exact bias mechanism of galaxy 
formation is still under debate (e.g. Kauffmann et al. 1997). 
(3) Normalisation by the local abundance of galaxy clusters (White et al. 1993; 
Eke et al. 1996; Viana & Liddle 1996): Galaxy clusters form by gravitational 
instability from dark-matter density perturbations. Their spatial number den- 
sity reflects the amplitude of appropriate dark-matter fluctuations in a very 
sensitive manner. It is therefore possible to determine the amplitude of the 
power spectrum by demanding that the local spatial number density of galaxy 
clusters be reproduced. Typical scales for dark-matter fluctuations collapsing 
to galaxy clusters are of order 10 Mpc, hence cluster normalisation deter- 
mines the amplitude of the power spectrum on just that scale. 

Since gravitational lensing by large-scale structures is generally sensitive to scales 
comparable to ^ ~ 12(^2o^^)Mpc, cluster normalisation appears to be the most 
appropriate normalisation method for the present purposes. The solid curve in Fig. 8 
shows the CDM power spectrum, linearly and non-linearly evolved to z = (or 
a= 1) in an Einstein-de Sitter universe with h = 0.5, normalised to the local cluster 
abundance. 
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Fig. 8. CDM power spectrum, normalised to the local abundance of galaxy clusters, for 
an Einstein-de Sitter universe with h = 0.5. Two curves are displayed. The solid curve 
shows the linear, the dashed curve the non-linear power spectrum. While the linear power 
spectrum asymptotically falls off o< the non-linear power spectrum, according to 
Peacock & Dodds (1996), illustrates the increased power on small scales due to non-linear 
effects, at the expense of larger-scale structures. 
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2.2.6 Non-Linear Evolution 



At late stages of the evolution and on small scales, the growth of density fluc- 
tuations begins to depart from the linear behaviour of eq. (2.52). Density fluctu- 
ations grow non-linear, and fluctuations of different size interact. Generally, the 
evolution of P{k) then becomes complicated and needs to be evaluated numeri- 
cally. However, starting from the bold ansatz that the two-point correlation func- 
tions in the linear and non-linear regimes are related by a general scaling relation 
(Hamilton et al. 1991), which turns out to hold remarkably well, analytic formu- 
lae describing the non-linear behaviour of P{k) have been derived (Jain et al. 1995; 
Peacock & Dodds 1996). It will turn out in subsequent chapters that the non-linear 
evolution of the density fluctuations is crucial for accurately calculating weak- 
lensing effects by large-scale structures. As an example, we show as the dashed 
curve in Fig. 8 the CDM power spectrum in an Einstein-de Sitter universe with 
h = 0.5, normalised to the local cluster abundance, non-linearly evolved to z = 0. 
The non-linear effects are immediately apparent: While the spectrum remains un- 
changed for large scales {k <^ ko), the amplitude on small scales {k ^ ko) is sub- 
stantially increased at the expense of scales just above the peak. It should be noted 
that non-linearly evolved density fluctuations are no longer fully characterised by 
the power spectrum only, because then non-Gaussian features develop. 



2. 2. 7 Poisson 's Equation 

Localised density perturbations which are much smaller than the horizon and whose 
peculiar velocities relative to the mean motion in the Universe are much smaller 
than the speed of light, can be described by Newtonian gravity. Their gravitational 
potential obeys Poisson's equation, 

V^O)' = 47iGp , (2.63) 

where p = (1 -I- 5)p is the total matter density, and 4>' is the sum of the potentials 
of the smooth background <l> and the potential of the perturbation 4>. The gradi- 
ent operates with respect to the physical, or proper, coordinates. Since Poisson's 
equation is linear, we can subtract the background contribution = 47iGp. Intro- 
ducing the gradient with respect to comoving coordinates = aV^, we can write 
eq. (2.63) in the form 

v2^) = 47uGfl^p5. (2.64) 

In the matter-dominated epoch, p = a~^po. With the critical density (2.15), Pois- 
son's equation can be re-written as 

V24>=— OrioS. (2.65) 
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2.3 Relevant Properties of Lenses and Sources 

Individual reviews have been written on galaxies (e.g. Faber & Gallagher 1979; 
BinggeU et al. 1988; GiovaneUi & Haynes 1991; Koo & Kron 1992; EUis 1997), 
clusters of galaxies (e.g. Bahcall 1977; Rood 1981; Forman & Jones 1982; 
Bahcall 1988; Sarazin 1986), and active galactic nuclei (e.g. Rees 1984; 
Weedman 1986; Blandford et al. 1990; Hartwick & Schade 1990; 

Warren & Hewett 1990; Antonucci 1993; Peterson 1997). A detailed presen- 
tation of these objects is not the purpose of this review. It suffices here to 
summarise those properties of these objects that are relevant for understanding 
the following discussion. Properties and peculiarities of individual objects are not 
necessary to know; rather, we need to specify the objects statistically. This section 
will therefore focus on a statistical description, leaving subtleties aside. 



2.3.1 Galaxies 

For the purposes of this review, we need to characterise the statistical proper- 
ties of galaxies as a class. Galaxies can broadly be grouped into two popula- 
tions, dubbed early-type and late-type galaxies, or ellipticals and spirals, respec- 
tively. While spiral galaxies include disks structured by more or less pronounced 
spiral arms, and approximately spherical bulges centred on the disk centre, el- 
liptical galaxies exhibit amorphous projected light distributions with roughly el- 
liptical isophotes. There are, of course, more elaborate morphological classifica- 
tion schemes (e.g. de Vaucouleurs et al. 1991; Buta et al. 1994; Naim et al. 1995a; 
Naim et al. 1995b), but the broad distinction between ellipticals and spirals suffices 
for this review. 

Outside galaxy clusters, the galaxy population consists of about 3/4 spiral galaxies 
and 1 /4 elliptical galaxies, while the fraction of ellipticals increases towards clus- 
ter centres. Elliptical galaxies are typically more massive than spirals. They contain 
little gas, and their stellar population is older, and thus 'redder', than in spiral galax- 
ies. In spirals, there is a substantial amount of gas in the disk, providing the material 
for ongoing formation of new stars. Likewise, there is little dust in ellipticals, but 
possibly large amounts of dust are associated with the gas in spirals. 

Massive galaxies have of order 10^^ solar masses, or 2 x 10^^ g within their visible 
radius. Such galaxies have luminosities of order 10^*^ times the solar luminosity. 
The kinematics of the stars, gas and molecular clouds in galaxies, as revealed by 
spectroscopy, indicate that there is a relation between the characteristic velocities 
inside galaxies and their luminosity (Faber & Jackson 1976; TuUy & Fisher 1977); 
brighter galaxies tend to have larger masses. 

The differential luminosity distribution of galaxies can very well be described by 
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the functional form 
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/ L\ dL 



(2.66) 



proposed by Schechter (1976). The parameters have been measured to be 



(e.g. Efstathiou et al. 1988; Marzke et al. 1994a; Marzke et al. 1994b). This distri- 
bution means that there is essentially a sharp cut-off in the galaxy population above 
luminosities of ~ L*, and the mean separation between -galaxies is of order 
^4>;^/^ ^4/7-1 Mpc. 

The stars in elliptical galaxies have randomly oriented orbits, while by far the most 
stars in spirals have orbits roughly coplanar with the galactic disks. Stellar veloc- 
ities are therefore characterised by a velocity dispersion Oy in ellipticals, and by 
an asymptotic circular velocity Vc in spirals. Q These characteristic velocities are 
related to galaxy luminosities by laws of the form 



where a ranges around 3—4. For spirals, eq. (2.68) is called TuUy- 
Fisher (TuUy & Fisher 1977) relation, for ellipticals Faber- Jackson 
(Faber & Jackson 1976) relation. Both velocity scales Oy,* and Vc.* are of or- 
der 220 km s^^ Since Vc = V20v, ellipticals with the same luminosity are more 
massive than spirals. 

Most relevant for weak gravitational lensing is a population of faint galaxies emit- 
ting bluer light than local galaxies, the so-called /am? blue galaxies (Tyson 1988; 
see Ellis 1997 for a review). There are of order 30 — 50 such galaxies per square 
arc minute on the sky which can be mapped with current ground-based optical tele- 
scopes, i.e. there are ^ 20, 000 — 40, 000 such galaxies on the area of the full moon. 
The picture that the sky is covered with a 'wall paper' of those faint and presumably 
distant blue galaxies is therefore justified. It is this fine-grained pattern on the sky 
that makes many weak-lensing studies possible in the first place, because it allows 
the detection of the coherent distortions imprinted by gravitational lensing on the 
images of the faint blue galaxy population. 

Due to their faintness, redshifts of the faint blue galaxies are hard to mea- 
sure spectroscopically. The following picture, however, seems to be reason- 

^ The circular velocity of stars and gas in spiral galaxies turns out to be fairly independent 
of radius, except close to their centre. These flat rotations curves cannot be caused by the 
observable matter in these galaxies, but provide strong evidence for the presence of a dark 
halo, with density profile p °< at large radii. 



V^l.l, L, ^ 1.1 X lO^^A 



^ 1.5 X IQ-^h^Mpc^^ 



(2.67) 




(2.68) 
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ably secure. It has emerged from increasingly deep and detailed observa- 
tions (see, e.g. Broadhurst et al. 1988; CoUess et al. 1991; CoUess et al. 1993; 
Lilly et al. 1991; Lilly 1993; Crampton et al. 1995; and also the reviews by 
Koo & Kron 1992 and Ellis 1997). The redshift distribution of faint galaxies has 
been found to agree fairly well with that expected for a non-evolving comoving 
number density. While the galaxy number counts in blue light are substantially 
above an extrapolation of the local counts down to increasingly faint magnitudes, 
those in the red spectral bands agree fairly well with extrapolations from local num- 
ber densities. Further, while there is significant evolution of the luminosity function 
in the blue, in that the luminosity scale L=i= of a Schechter-type fit increases with red- 
shift, the luminosity function of the galaxies in the red shows little sign of evolu- 
tion. Highly resolved images of faint blue galaxies obtained with the Hubble Space 
Telescope are now becoming available. In red light, they reveal mostly ordinary 
spiral galaxies, while their substantial emission in blue light is more concentrated 
to either spiral arms or bulges. Spectra exhibit emission lines characteristic of star 
formation. 

These findings support the view that the galaxy evolution towards higher redshifts 
apparent in blue light results from enhanced star-formation activity taking place 
in a population of galaxies which, apart from that, may remain unchanged even 
out to redshifts of z > 1 . The redshift distribution of the faint blue galaxies is then 
sufficiently well described by 



This expression is normalised to < z < oo and provides a good fit to the observed 
redshift distribution (e.g. Small et al. 1995b). The mean redshift (z) is proportional 
to zo, and the parameter P describes how steeply the distribution falls off beyond 
zo- For P = 1 .5, (z) ~ 1 .5 zq. The parameter zo depends on the magnitude cutoff and 
the colour selection of the galaxy sample. 

Background galaxies would be ideal tracers of distortions caused by gravitational 
lensing if they were intrinsically circular. Then, any measured ellipticity would di- 
rectly reflect the action of the gravitational tidal field of the lenses. Unfortunately, 
this is not the case. To first approximation, galaxies have intrinsically elliptical 
shapes, but the ellipses are randomly oriented. The intrinsic ellipticities introduce 
noise into the inference of the tidal field from observed ellipticities, and it is impor- 
tant for the quantification of the noise to know the intrinsic ellipticity distribution. 
Let |8| be the ellipticity of a galaxy image, defined such that for an ellipse with axes 
a and b <a. 



Ellipses have an orientation, hence the ellipticity has two components 81,2, with 




(2.69) 
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|£| = {ef + £2) It turns out empirically that a Gaussian is a good description for 
the ellipticity distribution, 

Pe(ei,e2)deid82 = ^^^^tdH^^^^deid82, (2.71) 
%oi [l-exp(-l/a^)J 

with a characteristic width of Oe ~ 0.2 (e.g. Miralda-Escude 1991b; 
Tyson & Seitzer 1988; Brainerd et al. 1996). We will later (Sect. 4.2) define 
galaxy ellipticities for the general situation where the isophotes are not ellipses. 
This completes our summary of galaxy properties as required here. 



2.3.2 Groups and Clusters of Galaxies 

Galaxies are not randomly distributed in the sky. Their positions are correlated, and 
there are areas in the sky where the galaxy density is noticeably higher or lower 
than average (cf. the galaxy count map in Fig. 9). There are groups consisting of 
a few galaxies, and there are clusters of galaxies in which some hundred up to a 
thousand galaxies appear very close together. 




Fig. 9. The Lick galaxy counts within 50° radius around the North Galactic pole 
(Seldner et al. 1977). The galaxy number density is highest at the black and lowest at the 
white regions on the map. The picture illustrates structure in the distribution of fairly nearby 
galaxies, viz. under-dense regions, long extended filaments, and clusters of galaxies. 

The most prominent galaxy cluster in the sky covers a huge area centred on the 
Virgo constellation. Its central region has a diameter of about 7°, and its main body 
extends over roughly 15° x 40°. It was already noted by Sir William Herschel in 
the 18th century that the entire Virgo cluster covers about l/8th of the sky, while 
containing about l/3rd of the galaxies observable at that time. 
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Zwicky 



noted in 1933 that the galaxies in the Coma cluster and other rich clusters move so 
fast that the clusters required about ten to 100 times more mass to keep the galaxies 
bound than could be accounted for by the luminous galaxies themselves. This was 
the earliest indication that there is invisible mass, or dark matter, in at least some 
objects in the Universe. 

Several thousands of galaxy clusters are known today. Abell's (1958) cluster cat- 
alog lists 2712 clusters north of —20° declination and away from the Galac- 
tic plane. Employing a less restrictive definition of galaxy clusters, the catalog 
by Zwicky etal. (1968) identifies 9134 clusters north of —3° declination. Clus- 
ter masses can exceed 10'^^ g or 5 x 10^^ Mq, and they have typical radii of 
fa 5 X 10^"^ cm or l.SMpc. 




Fig. 10. The galaxy cluster Abell 370, in which the first gravitationally lensed arc was 

detected (Lynds & Petrosian 1986; Soucail et al. 1987a, 1987b). Most of the bright galaxies 
seen are cluster members at z = 0.37, whereas the arc, i.e. the fiighly elongated feature, is 
the image of a galaxy at redshift z = 0.724 (Soucail et al. 1988). 

When X-ray telescopes became available after 1966, it was discovered that clus- 
ters are powerful X-ray emitters. Their X-ray luminosities fall within (10^^ — 
10'^^)ergs~\ rendering galaxy clusters the most luminous X-ray sources in the 
sky. Improved X-ray telescopes revealed that the source of X-ray emission in clus- 
ters is extended rather than point-like, and that the X-ray spectra are best explained 
by thermal bremsstrahlung (free-free radiation) from a hot, dilute plasma with tem- 
peratures in the range (10^ — 10^) K and densities of ~ 10^^ particles per cm-^. 
Based on the assumption that this intra-cluster gas is in hydrostatic equilibrium 
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with a spherically symmetric gravitational potential of the total cluster matter, the 
X-ray temperature and flux can be used to estimate the cluster mass. Typical re- 
sults approximately (i.e. up to a factor of ~ 2) agree with the mass estimates from 
the kinematics of cluster galaxies employing the virial theorem. The mass of the 
intra-cluster gas amounts to about 10% of the total cluster mass. The X-ray emis- 
sion thus independently confirms the existence of dark matter in galaxy clusters. 
Sarazin (1986) reviews clusters of galaxies focusing on their X-ray emission. 

Later, luminous arc-like features were discovered in two galaxy clusters 
(Lynds & Petrosian 1986; Soucail et al. 1987a, 1987b; see Fig. 10). Their Ught is 
typically bluer than that from the cluster galaxies, and their length is comparable to 
the size of the central cluster region. Paczyriski (1987) suggested that these arcs are 
images of galaxies in the background of the clusters which are strongly distorted by 
the gravitational tidal field close to the cluster centres. This explanation was gen- 
erally accepted when spectroscopy revealed that the sources of the arcs are much 
more distant than the clusters in which they appear (Soucail et al. 1988). 

Large arcs require special alignment of the arc source with the lensing clus- 
ter. At larger distance from the cluster centre, images of background galaxies 
are only weakly deformed, and they are referred to as arclets (Fort et al. 1988; 
Tyson et al. 1990). The high number density of faint arclets allows one to mea- 
sure the coherent distortion caused by the tidal gravitational field of the cluster out 
to fairly large radii. One of the main applications of weak gravitational lensing is 
to reconstruct the (projected) mass distribution of galaxy clusters from their mea- 
surable tidal fields. Consequently, the corresponding theory constitutes one of the 
largest sections of this review. 

Such strong and weak gravitational lens effects offer the possibility to detect and 
measure the entire cluster mass, dark and luminous, without referring to any equi- 
librium or symmetry assumptions like those required for the mass estimates from 
galactic kinematics or X-ray emission. For a review on arcs and arclets in galaxy 
clusters see Fort & MeUier (1994). 

Apart from being spectacular objects in their own right, clusters are also of par- 
ticular interest for cosmology. Being the largest gravitationally bound entities in 
the cosmos, they represent the high- mass end of collapsed structures. Their num- 
ber density, their individual properties, and their spatial distribution constrain the 
power spectrum of the density fluctuations from which the structure in the uni- 
verse is believed to have originated (e.g. Viana & Liddle 1996; Eke et al. 1996). 
Their formation history is sensitive to the parameters that determine the geometry 
of the universe as a whole. If the matter density in the universe is high, clusters 
tend to form later in cosmic history than if the matter density is low (first noted by 
Richstone et al. 1992). This is due to the behaviour of the growth factor shown in 
Fig. 6, combined with the Gaussian nature of the initial density fluctuations. Conse- 
quently, the compactness and the morphology of clusters reflect the cosmic matter 
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density, and this has various observable implications. One method to normalise the 
density-perturbation power spectrum fixes its overall amplitude such that the local 
spatial number density of galaxy clusters is reproduced. This method, called cluster 
normalisation and pioneered by White et al. (1993), will frequently be used in this 
review. 

In summary, clusters are not only regions of higher galaxy number density in 
the sky, but they are gravitationally bound bodies whose member galaxies con- 
tribute only a small fraction of their mass. About 80% of their mass is dark, and 
roughly 10% is in the form of the diffuse. X-ray emitting gas spread throughout 
the cluster. Mass estimates inferred from galaxy kinematics. X-ray emission, and 
gravitational-lensing effects generally agree to within about a factor of two, typi- 
cally arriving at masses of order 5 x 10^^ solar masses, or 10^8 g. Typical sizes of 
galaxy clusters are of order several megaparsecs, or 5 x 10^ cm. In addition, there 
are smaller objects, called galaxy groups, which contain fewer galaxies and have 
typical masses of order 10^^ solar masses. 



2.3.3 Active Galactic Nuclei 

The term 'active galactic nuclei' (AGNs) is applied to galaxies which show signs of 
non-stellar radiation in their centres. Whereas the emission from 'normal' galaxies 
like our own is completely dominated by radiation from stars and their remnants, 
the emission from AGNs is a combination of stellar light and non-thermal emission 
from their nuclei. In fact, the most prominent class of AGNs, the quasi-stellar radio 
sources, or quasars, have their names derived from the fact that their optical appear- 
ance is point-like. The nuclear emission almost completely outshines the extended 
stellar light of its host galaxy. 

AGNs do not form a homogeneous class of objects. Instead, they are grouped into 

several types. The main classes are: quasars, quasi-stellar objects (QSOs), Seyfert 
galaxies, BL Lacertae objects (BL Lacs), and radio galaxies. What unifies them 
is the non-thermal emission from their nucleus, which manifests itself in various 
ways: (1) radio emission which, owing to its spectrum and polarisation, is inter- 
preted as synchrotron radiation from a power-law distribution of relativistic elec- 
trons; (2) strong ultraviolet and optical emission lines from highly ionised species, 
which in some cases can be extremely broad, corresponding to Doppler velocities 
up to ~ 20, 000km s^^ thus indicating the presence of semi-relativistic velocities in 
the emission region; (3) a flat ultraviolet-to-optical continuum spectrum, often ac- 
companied by polarisation of the optical light, which cannot naturally be explained 
by a superposition of stellar (Planck) spectra; (4) strong X-ray emission with a 
hard power-law spectrum, which can be interpreted as inverse Compton radiation 
by a population of relativistic electrons with a power-law energy distribution; (5) 
strong gamma-ray emission; (6) variability at all wavelengths, from the radio to 
the gamma-ray regime. Not all these phenomena occur at the same level in all 
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the classes of AGNs. QSOs, for example, can roughly be grouped into radio-quiet 
QSOs and quasars, the latter emitting strongly at radio wavelengths. 

Since substantial variability cannot occur on timescales shorter than the light-travel 
time across the emitting region, the variability provides a rigourous constraint on 
the compactness of the region emitting the bulk of the nuclear radiation. In fact, this 
causality argument based on light-travel time can mildly be violated if relativistic 
velocities are present in the emitting region. Direct evidence for this comes from the 
observation of the so-called superluminal motion, where radio-source components 
exhibit apparent velocities in excess of c (e.g. Zensus & Pearson 1987). This can 
be understood as a projection effect, combining velocities close to (but of course 
smaller than) the velocity of light with a velocity direction close to the line-of-sight 
to the observer. Observations of superluminal motion indicate that bulk velocities 
of the radio-emitting plasma components can have Lorentz factors of order 10, i.e., 
they move at ~ 0.99 c. 

The standard picture for the origin of this nuclear activity is that a supermassive 
black hole (or order 10^ M©), situated in the centre of the host galaxy, accretes 
gas from the host. In this process, gravitational binding energy is released, part of 
which can be transformed into radiation. The appearance of an AGN then depends 
on the black-hole mass and angular momentum, the accretion rate, the efficiency of 
the transformation of binding energy into radiation, and on the orientation relative 
to the line-of-sight. The understanding of the physical mechanisms in AGNs, and 
how they are related to their phenomenology, is still rather incomplete. We refer 
the reader to the books and articles by Begelman et al. (1984), Weedman (1986), 
Blandford et al. (1990), Peterson (1997), and Krolik (1999), and references therein, 
for an overview of the phenomena in AGNs, and of our current ideas on their in- 
terpretation. For the current review, we only make use of one particular property of 
AGNs: 

QSOs can be extremely luminous. Their optical luminosity can reach a factor of 
thousand or more times the luminosity of normal galaxies. Therefore, their nuclear 
activity completely outshines that of the host galaxy, and the nuclear sources appear 
point-like on optical images. Furthermore, the high luminosity implies that QSOs 
can be seen to very large distances, and in fact, until a few years ago QSOs held the 
redshift record. In addition, the comoving number density of QSOs evolves rapidly 
with redshift. It was larger than today by a factor of ~ 100 at redshifts between 2 
and 3. Taken together, these two facts imply that a flux-limited sample of QSOs has 
a very broad redshift distribution, in particular, very distant objects are abundant in 
such a sample. 

However, it is quite difficult to obtain a 'complete' flux-limited sample of QSOs. 
Of all point-like objects at optical wavelengths, QSOs constitute only a tiny frac- 
tion, most being stars. Hence, morphology alone does not suffice to obtain a can- 
didate QSO sample which can be verified spectroscopically. However, QSOs are 
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found to have very blue optical colours, by which they can efficiently be selected. 
Colour selection typically yields equal numbers of white dwarfs and QSOs with 
redshifts below ~ 2.3. For higher-redshift QSOs, the strong Lya emission line 
moves from the U-band filter into the B-band, yielding redder U— B colours. For 
these higher-redshift QSOs, multi-colour or emission-line selection criteria must 
be used (cf. Fan et al. 1999). In contrast to optical selection, AGNs are quite ef- 
ficiently selected in radio surveys. The majority of sources selected at centimeter 
wavelengths are AGNs. A flux-limited sample of radio- selected AGNs also has a 
very broad redshift distribution. The large fraction of distant objects in these sam- 
ples make AGNs particularly promising sources for the gravitational lensing effect, 
as the probability of finding an intervening mass concentration close to the line-of- 
sight increases with the source distance. In fact, most of the known multiple-image 
gravitational lens systems have AGN sources. 

In addition to their high redshifts, the number counts of AGNs are important for 
lensing. For bright QSOs with apparent B-band magnitudes B < 19, the differential 
source counts can be approximated by a power law, n{S) °^ S^^^'^^\ where n{S) dS 
is the number density of QSOs per unit solid angle with flux within d5 of S, and a k, 
2.6. At fainter magnitudes, the differential source counts can also be approximated 
by a power law in flux, but with a much flatter index of a ~ 0.5. The source counts 
at radio wavelengths are also quite steep for the highest fluxes, and flatten as the 
flux decreases. The steepness of the source counts will be the decisive property of 
AGNs for the magnification bias, which will be discussed in Sect. 6. 

2.4 Correlation Functions, Power Spectra, and their Projections 

2.4. 1 Definitions; Homogeneous and Isotropic Random Fields 

In this subsection, we define the correlation function and the power spectrum of a 
random field, which will be used extensively in later sections. One example already 
occurred above, namely the power spectrum P5 of the density fluctuation field 5. 

Consider a random field g{x) whose expectation value is zero everywhere. This 
means that an average over many realisations of the random field should vanish, 
(g(x)) = 0, for all X. This is not an important restriction, for if that was not the case, 
we could consider the field g{x) — {g{x)) instead, which would have the desired 
property. Spatial positions x have n dimensions, and the field can be either real or 
complex. 

A random field g{x) is called homogeneous if it cannot statistically be distinguished 
from the field g{x + y), where y is an arbitrary translation vector. Similarly, a ran- 
dom field g{x) is called isotropic if it has the same statistical properties as the 
random field g{'K,x), where 5?, is an arbitrary rotation matrix in n dimensions. Re- 
stricting our attention to homogeneous and isotropic random fields, we note that 
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the two-point correlation function 



{8{x)g*{y))^C,,i\x-y\) (2.72) 

can only depend on the absolute value of the difference vector between the two 

points X and y. Note that Cgg is real, even if g is complex. This can be seen by 
taking the complex conjugate of (2.72), which is equivalent to interchanging x and 
y, leaving the right-hand-side unaffected. 



We define the Fourier-transform pair of g as 



g{k)=f d«xg(x)e-^ g{x)^ [ —-g{k)c-^-K (2.73) 
We now calculate the correlation function in Fourier space, 

{g{k)g*{k'))= f d«xe«-^/ dVe-«"-^(g(x)g*(x')). (2.74) 
Using (2.72) and substituting x' =x + y, this becomes 

{g{k)tik'))= f d"xc^' f d"yc-^^'+y>^^ CM) 

JM" JM" 

= (27t)«5D(^-^') / d"yc-'y''C,,i\y\) 

JM" 

= {2Ti)"8B(k-k')Pg{\k\) . (2.75) 

In the final step, we defined the power spectrum of the homogeneous and isotropic 
random field g, 

pM=[ d'^y^-'y-^'Cggim , (2.76) 

which is the Fourier transform of the two-point correlation function. Isotropy of the 
random field implies that Pg can only depend on the modulus of k. 

Gaussian random fields are characterised by the property that the probability dis- 
tribution of any linear combination of the random field g{x) is Gaussian. More gen- 
erally, the joint probability distribution of a number M of linear combinations of 
the random variable g{xi) is a multivariate Gaussian. This is equivalent to requiring 
that the Fourier components g(k) are mutually statistically independent, and that 
the probability densities for the g(k) are Gaussian with dispersion P^(|^|). Thus, a 
Gaussian random field is fully characterised by its power spectrum. 
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2.4.2 Projections; Limber's Equation 



We now derive a relation between the power spectrum (or the correlation function) 
of a homogeneous isotropic random field in three dimensions, and its projection 
onto two dimensions. Specifically, for the three-dimensional field, we consider the 
density contrast 5[/jf (w)0, w], where is a two-dimensional vector, which could 
be an angular position on the sky. Hence, /a:(w)9 and w form a local comov- 
ing isotropic Cartesian coordinate system. We define two different projections of 
5 along the backward-directed light cone of the observer at w = 0, ? = tQ, 

gi{Q) = j dwqi{w)h[fK{wfQM , (2.77) 

for z = 1,2. The qi{w) are weight functions, and the integral extends from w = to 
the horizon w = wh- Since 5 is a homogeneous and isotropic random field, so is its 
projection. Consider now the correlation function 

Ci2 = (gi(e)g2(e')) 

= j dwqi{w) j dw' q2{w'){h\fK{w)QM^\fK{w')Q\w']) . (2.78) 

We assume that there is no power in the density fluctuations on scales larger than a 
coherence scale Lco^- This is justified because the power spectrum declines k 
as A: 0; see (2.62). This implies that the correlation function on the right-hand 
side of eq. (2.78) vanishes for wy{^\w — w'\ > Lgoh- Although 5 evolves cosmo- 
logically, it can be considered constant over a time scale on which light travels 
across a comoving distance Lcoh- We note that the second argument of 5 simul- 
taneously denotes the third local spatial dimension and the cosmological epoch, 
related through the light-cone condition \cdt \ = adw. Furthermore, we assume that 
the weight functions qiiw) do not vary appreciably over a scale Aw < L^oh- Con- 
sequently, I 

^ — ^ I ^ ^coh over the scale where C§5 is non-zero, and we can set 
/a'(w') fK{w) and qiiw') = q2{w) to obtain 

Ci2(e) = j dwqi{w)q2iw) j d(Aw)C55 (^^ fl{w)e^ + {Aw)^w^ . (2.79) 

The second argument of C§§ now denotes the dependence of the correlation func- 
tion on cosmic epoch. Equation (2.79) is one form of Limber's (1953) equation, 
which relates the two-point correlation of the projected field to that of the three- 
dimensional field. 

Another very useful form of this equation relates the projected two-point correla- 
tion function to the power spectrum of the three-dimensional field. The easiest way 
to derive this relation is by replacing the 5's in (2.78) by their Fourier transforms, 
where upon 
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f f f d^k f d^k' 



(271)3 i (271)3 

X (5(^,w)5*(^',w'))e-i^^("')^^-^ei^'^("'')^x-e'e-i^3V^3X'' . (2.80) 

k± is the two-dimensional wave vector perpendicular to the line-of-sight. The cor- 
relator can be replaced by the power spectrum P5 using (2.75). This introduces a 
Dirac delta function 5d(^ — k ), which allows us to carry out the P-integration. Un- 
der the same assumptions on the spatial variation of qi{w) and /a'(w) as before, we 
find 

Ci2 = I dwqi{w)q2{w) J ^P5(|^|,w)e-i^^(")^^-(«-^')e-'^3w 

X J dw'e^^3w' (2.81) 

The final integral yields 27c5d(A:3), indicating that only such modes contribute to 
the projected correlation function whose wave-vectors lie in the plane of the sky 
(Blandford et al. 1991). Finally, carrying out the trivial ^3 -integration yields 



Ci2(e) = J dwqi{w)q2{w) J ^ A(l^±|,H')e-i^^W*^-^ (2.82) 

= J dwqi{w)q2iw) J —P{k,w)Jo[Mw)ek] . (2.83) 

The definition (2.73) of the Fourier transform, and the relation (2.76) between 
power spectrum and correlation function allow us to write the (cross) power spec- 
trum Pi2(/) as 



Pnil) = J d20Ci2(e)e" 

= I dwqi{w)q2{w) J ^P8{\k^\,w) {Inf h^,^ - fk{w)k 



j2q^ ./Q^„i/•e 

d^k\ 



I 



^^M^ (I (2.84) 



which is Limber's equation in Fourier space (Kaiser 1992, 1998). We shall make 
extensive use of these relations in later sections. 
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3 Gravitational Light Deflection 



In this section, we summarise the theoretical basis for the description of Ught de- 
flection by gravitational fields. Granted the validity of Einstein's Theory of General 
Relativity, light propagates on the null geodesies of the space-time metric. How- 
ever, most astrophysically relevant situations permit a much simpler approximate 
description of light rays, which is called gravitational lens theory; we first describe 
this theory in Sect. 3.1. It is sufficient for the treatment of lensing by galaxy clus- 
ters in Sect. 5, where the deflecting mass is localised in a region small compared 
to the distance between source and deflector, and between deflector and observer. 
In contrast, mass distributions on a cosmic scale cause small light deflections all 
along the path from the source to the observer. The magnification and shear effects 
resulting therefrom require a more general description, which we shall develop in 
Sect. 3.2. In particular, we outline how the gravitational lens approximation derives 
from this more general description. 



3.1 Gravitational Lens Theory 




Fig. 11. Sketch of a typical gravitational lens system. 
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A typical situation considered in gravitational lensing is sketched in Fig. 11, where 
a mass concentration at redshift Zd (or angular diameter distance D^) deflects the 
light rays from a source at redshift Zs (or angular diameter distance D^). If there 
are no other deflectors close to the line-of-sight, and if the extent of the deflecting 
mass along the line-of-sight is very much smaller than both and the angular 
diameter distance D^s from the deflector to the source,Qthe actual light rays which 
are smoothly curved in the neighbourhood of the deflector can be replaced by two 
straight rays with a kink near the deflector. The magnitude and direction of this 
kink is described by the deflection angle a, which depends on the mass distribution 
of the deflector and the impact vector of the light ray. 



3.1.1 The Deflection Angle 

Consider first the deflection by a point mass M. If the light ray does not prop- 
agate through the strong gravitational field close to the horizon, that is, if its 
impact parameter ^ is much larger than the Schwarzschild radius of the lens, 
^ ^ 7?s = 2GMc^, then General Relativity predicts that the deflection angle a 
is 

^ 4GM 

d=^. (3.1) 
c^q 

This is just twice the value obtained in Newtonian gravity (see the historical re- 
marks in Schneider et al. 1992). According to the condition ^ ^ Rs, the deflection 
angle is small, d ^ 1 . 

The field equations of General Relativity can be linearised if the gravitational field 
is weak. The deflection angle of an ensemble of point masses is then the (vectorial) 
sum of the deflections due to individual lenses. Consider now a three-dimensional 
mass distribution with volume density p(r). We can divide it into cells of size dV 
and mass dm = p(r) dV. Let a light ray pass this mass distribution, and describe 
its spatial trajectory by (^i (X) , ^2(^) , ^2, (k)), where the coordinates are chosen such 
that the incoming light ray (i.e. far from the deflecting mass distribution) propagates 
along r^. The actual light ray is deflected, but if the deflection angle is small, it can 
be approximated as a straight line in the neighbourhood of the deflecting mass. 
This corresponds to the Bom approximation in atomic and nuclear physics. Then, 
^(X) = ^, independent of the affine parameter X. Note that ^ = (^1,^2) is a two- 
dimensional vector. The impact vector of the light ray relative to the mass element 
dm at r = , ^3) is then ^ — independent of rj, and the total deflection angle 
is 



^ This condition is very well satisfied in most astrophysical situations. A cluster of galax- 
ies, for instance, has a typical size of a few Mpc, whereas the distances Dj, D^, and Djs are 
fair fractions of the Hubble length cH^^ =3h^^ x 10^ Mpc. 
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c i^-^r 

= ^/ dr',p{^[,^„r',) 1^ , (3.2) 

which is also a two-dimensional vector. Since the last factor in eq. (3.2) is inde- 
pendent of /g, the -integration can be carried out by defining the surface mass 
density 

£(!) = I dr3p(^i,^2,r3), (3.3) 

which is the mass density projected onto a plane perpendicular to the incoming 
light ray. Then, the deflection angle finally becomes 

a(|) = ^/d^4'E(l')|^. (3.4) 

This expression is valid as long as the deviation of the actual light ray from a 
straight (undeflected) line within the mass distribution is small compared to the 
scale on which the mass distribution changes significantly. This condition is satis- 
fied in virtually all astrophysically relevant situations (i.e. lensing by galaxies and 
clusters of galaxies), unless the deflecting mass extends all the way from the source 
to the observer (a case which will be dealt with in Sect. 6). It should also be noted 
that in a lensing situation such as displayed in Fig. 11, the incoming light rays 
are not mutually parallel, but fall within a beam with opening angle approximately 
equal to the angle which the mass distribution subtends on the sky. This angle, 
however, is typically very small (in the case of cluster lensing, the relevant angular 
scales are of order 1 arc min fa 2.9 x 10~^). 



3.1.2 The Lens Equation 

We now require an equation which relates the true position of the source to its 
observed position on the sky. As sketched in Fig. 11, the source and lens planes are 
defined as planes perpendicular to a straight line (the optical axis) from the observer 
to the lens at the distance of the source and of the lens, respectively. The exact 
definition of the optical axis does not matter because of the smallness of angles 
involved in a typical lens situation, and the distance to the lens is well defined for a 
geometrically-thin matter distribution. Let r\ denote the two-dimensional position 
of the source on the source plane. Recalling the definition of the angular-diameter 
distance, we can read off Fig. 1 1 

T\ = ^l-D^m. (3.5) 
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Introducing angular coordinates by fj = DsP and f = D^B, we can transform 
eq. (3.5) to 

^ = e-^ a(Dde) = - a(e) , (3.6) 

where we defined the scaled deflection angle a(0) in the last step. The interpretation 
of the lens equation (3.6) is that a source with true position P can be seen by an 

— * 

observer at angular positions 6 satisfying (3.6). If (3.6) has more than one solution 

— * — * 

for fixed P, a source at (3 has images at several positions on the sky, i.e. the lens 
produces multiple images. For this to happen, the lens must be 'strong'. This can 
be quantified by the dimension-less surface mass density 

K(e) = 5M „iu, E„=^^, (3.7) 



-cr 



4nG DiDds ' 



where Lcr is called the critical surface mass density (which depends on the redshifts 
of source and lens). A mass distribution which has K > 1 somewhere, i.e. E > Ecr, 
produces multiple images for some source positions |3 (see Schneider et al. 1992, 
Sect. 5.4.3). Hence, Ecj- is a characteristic value for the surface mass density which 
distinguishes between 'weak' and 'strong' lenses. Note that k > 1 is sufficient but 
not necessary for producing multiple images. In terms of K, the scaled deflection 
angle reads 

cc(e) = l / d^e^K(eO -.^"5 . (3.8) 

^ ' njR2 ^ ^ |0-0'|2 



Equation (3.8) implies that the deflection angle can be written as the gradient of the 
deflection potential, 

\|/(0) = - / dVK(0') ln|0-0'| , (3.9) 

as a = V\\f. The potential \\f{Q) is the two-dimensional analogue of the Newtonian 
gravitational potential and satisfies the Poisson equation V^\|/(0) = 2k(0). 



3.1.3 Magnification and Distortion 

— * 

The solutions of the lens equation yield the angular positions of the images of 

— * 

a source at p. The shapes of the images will differ from the shape of the source 
because light bundles are deflected differentially. The most visible consequence of 

this distortion is the occurrence of giant luminous arcs in galaxy clusters. In gen- 
eral, the shape of the images must be determined by solving the lens equation for 
all points within an extended source. Liouville's theorem and the absence of emis- 
sion and absorption of photons in gravitational light deflection imply that lensing 
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conserves surface brightness (or specific intensity). Hence, if I^"^ (P) is the surface 
brightness distribution in the source plane, the observed surface brightness distri- 
bution in the lens plane is 

/(e)=/(^)[P(e)]. (3.10) 

If a source is much smaller than the angular scale on which the lens properties 
change, the lens mapping can locally be linearised. The distortion of images is then 
described by the Jacobian matrix 

■■ 1. (3.1.) 




ae \^ " ae,aey 

where we have introduced the components of the shear y= yi +iy2 = |Y|e^^*'', 

Yi = ^(¥,ii-¥,22) , Y2 = ¥,i2, (3.12) 

— * 

and K is related to \|/ through Poisson's equation. Hence, if Gq is a point within an 
image, corresponding to the point Po = P(9o) within the source, we find from (3.10) 
using the locally linearised lens equation 



7(0) [po+-^^(eo)-(e-0o) 



(3.13) 



According to this equation, the images of a circular source are ellipses. The ratios of 
the semi-axes of such an ellipse to the radius of the source are given by the inverse 
of the eigenvalues of si (Qq), which are 1 — k± |y|, and the ratio of the solid angles 
subtended by an image and the unlensed source is the inverse of the determinant of 
Jl . The fluxes observed from the image and from the unlensed source are given as 
integrals over the brightness distributions 7(0) and 7(*)(P), respectively, and their 
ratio is the magnification //(0o). From (3.13), we find 

^"dit:i"(i-K)2-iY|2- ^^-^"^^ 

The images are thus distorted in shape and size. The shape distortion is due to 
the tidal gravitational field, described by the shear y, whereas the magnification 
is caused by both isotropic focusing caused by the local matter density K and 
anisotropic focusing caused by shear. 

Since the shear is defined by the trace-free part of the symmetric Jacobian matrix 
J?, it has two independent components. There exists a one-to-one mapping from 

symmetric, trace-free 2x2 matrices onto complex numbers, and we shall exten- 
sively use complex notation. Note that the shear transforms as e^^*P under rotations 
of the coordinate frame, and is therefore not a vector. Equations (3.9) and (3.12) 
imply that the complex shear can be written 
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y(0) = - [ d2e'2)(0-eOK(0') 



with © (0) = ^ — = — —o ■ (3.15) 

^ ^ |0|4 01-102 2 



3.1.4 Critical Curves and Caustics 

Points in the lens plane where the Jacobian Jl is singular, i.e. where detJ? = 0, 
form closed curves, the critical curves. Their image curves in the source plane 
are called caustics. Equation (3.14) predicts that sources on caustics are infinitely 
magnified; however, infinite magnification does not occur in reality, for two rea- 
sons. First, each astrophysical source is extended, and its magnification (given by 
the surface brightness-weighted point-source magnification across its solid angle) 
remains finite. Second, even point sources would be magnified by a finite value 
since for them, the geometrical-optics approximation fails near critical curves, 
and a wave-optics description leads to a finite magnification (e.g. Ohanian 1983; 
Schneider et al. 1992, Chap. 7). For the purposes of this review, the first effect al- 
ways dominates. Nevertheless, images near critical curves can be magnified and 
distorted substantially, as is demonstrated by the giant luminous arcs which are 
formed from source galaxies close to caustics. (Point) sources which move across a 
caustic have their number of images changed by ±2, and the two additional images 
appear or disappear at the corresponding critical curve in the lens plane. Hence, 
only sources inside a caustic are multiply imaged. 



3.1.5 An Illustrative Example: Isothermal Spheres 

The rotation curves of spiral galaxies are observed to be approximately flat out to 

the largest radii where they can be measured. If the mass distribution in a spiral 
galaxy followed the light distribution, the rotation curves would have to decrease at 
large radii in roughly Keplerian fashion. Flat rotation curves thus provide the clear- 
est evidence for dark matter on galactic scales. They can be understood if galactic 
disks are embedded in a dark halo with density profile p »: r"^ for large r. The 
projected mass density then behaves like 0^^ Such density profiles are obtained 
by assuming that the velocity dispersion of the dark matter particles is spatially 
constant. They are therefore also called isothermal profiles. We shall describe some 
simple properties of a gravitational lens with an isothermal mass profile, which 
shall later serve as a reference. 

The projected surface mass density of a singular isothermal sphere is 

(3.16) 

where Ov is the line-of-sight velocity dispersion of the 'particles' (e.g. stars in 
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galaxies, or galaxies in clusters of galaxies) in the gravitational potential of the 
mass distribution, assuming that they are in virial equilibrium. The corresponding 
dimensionless surface mass density is 

K(e) = ^, where 0E = 47r(^)'^ (3.17) 

is called the Einstein deflection angle. As can easily be verified from (3.8), the 
magnitude of the scaled deflection angle is constant for this mass profile, |a| = 6e, 
and the deflection potential is \|/ = 0e|§|. From that, the shear is obtained using 



(3.12)5 



Y(0) = — ^e^'^, (3.18) 
2101 



and the magnification is 



This shows that |0| = 0e defines a critical curve, which is called the Einstein circle. 
The corresponding caustic, obtained by mapping the Einstein circle back into the 
source plane under the lens equation, degenerates to a single point at |3 = 0. Such 
degenerate caustics require highly symmetric lenses. Any perturbation of the mass 
distribution breaks the degeneracy and expands the singular caustic point into a 
caustic curve (see Chapter 6 in Schneider et al. 1992 for a detailed treatment of 
critical curves and caustics). The lens (3.17) produces two images with angular 
separation 20e for a source with |P| < 1, and one image otherwise. 

The mass distribution (3.17) has two unsatisfactory properties. The surface mass 
density diverges for |0| ^0, and the total mass of the lens is infinite. Clearly, 
both of these properties will not match real mass distributions. Despite this fact, 
the singular isothermal sphere fits many of the observed lens systems fairly well. 
In order to construct a somewhat more realistic lens model, one can cut off the 
distribution at small and large distances, e.g. by 

9e 9e 
k(0) = — , (3.20) 



2y'|0|2 + 02 2y'|0|2 + 02 

which has a core radius 0c, and a truncation radius 0t. For 0c ^ |9| <: 0t, this 
mass distribution behaves like 0^. This lens can produce three images, but only 



^ For axially-symmetric projected mass profiles, the magnitude of the shear can be cal- 
culated from |y|(6) = k(9) — k(6), where k(6) is the mean surface mass density inside a 
circle of radius 6 from the lens centre. Accordingly, the magnitude of the deflection angle 
is |a| = 0k(e). 



51 



if 0c6t (6c + 0t)~^ < 0e/2. One of the three images occurs near the centre of the 
lens and is strongly de-magnified if 0c ^ Oe- In most of the multiple-image QSO 
lens systems, there is no indication for a third central image, imposing strict upper 
bounds on Sc, whereas for some arc systems in clusters, a finite core size is required 
when a lens model like (3.20) is assumed. 



3.2 Light Propagation in Arbitrary Space times 

We now turn to a more rigourous description of the propagation of light rays, based 
on the theory of geometrical optics in General Relativity. We then specialise the 
resulting propagation equations to the case of weak gravitational fields and metric 
perturbations to the background of an expanding universe. These equations contain 
the gravitational lens equation discussed previously as a special case. We shall keep 
the discussion brief and follow closely the work of Schneider et al. (1992, Chaps. 3 
& 4), and Seitz et al. (1994), where further references can be found. 

3.2.1 Propagation of Light Bundles 

In Sect. 3.1.2, we have derived the lens equation (3.5) in a heuristic way. A 
rigourous derivation in an arbitrary spacetime must account for the fact that distance 
vectors between null geodesies are four-vectors. Nevertheless, by choosing an ap- 
propriate coordinate system, the separation transverse to the line-of-sight between 
two neighbouring light rays can effectively be described by a two-dimensional vec- 

— * 

tor ^. We outline this operation in the following two paragraphs. 

We first consider the propagation of infinitesimally thin light beams in an arbitrary 
space-time, characterised by the metric tensor g^v The propagation of a fiducial ray 
Yo of the bundle is determined by the geodesic equation (e.g. Misner et al. 1973; 
Weinberg 1972). We are interested here in the evolution of the shape of the bundle 
as a function of the affine parameter along the fiducial ray. Consider an observer 
O with four- velocity Uq, satisfying UqUq^ = 1. The physical wave vector k^* of a 
photon depends on the light frequency. We define = — c~^tOo^'^ as a past-directed 
dimensionless wave vector which is independent of the frequency tOo measured by 
the observer. We choose an affine parameter X of the rays passing through O such 
that (1) X = at the observer, (2) X increases along the backward light cone of O, 
and (3) U^k^ = - 1 at O. Then, with the definition of it follows that F = djc^/ dl, 
and that X measures the proper distance along light rays for events close to O. 

Let y(9,?i) characterise the rays of a light beam with vertex at O, such that 
is the angle between a ray and the fiducial ray with '^(k) = y(0,X). Further, let 
y^(0,X) = y(0,X) -/(0,X) = [dt(Q,l)/dQk\Qk denote the vector connecting the 
ray characterised by with the fiducial ray at the same affine parameter X, where 
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we assumed sufficiently small |0| so that can be linearised in 0. We can then 
decompose as follows. At O, the vectors Uq and define a two-dimensional 
plane perpendicular to both Uq and k^. This plane is tangent to the sphere of di- 
rections seen by the observer. Now choose orthonormal unit vectors Ei and E2 to 
span that plane. Hence, E'^E2^ — 0, E^Ei^^j = — 1, E^k/j — E^Uq/j = 0, for ^ = 1,2. 
Transporting the four vectors k^, Uq, E^, and £'2 parallel along the fiducial ray de- 
fines a vierbein at each event along the fiducial ray. The deviation vector can then 
be decomposed into 

F''(e, X) = -^1 (6, X) E^ - ^2 (e, I) E^ - ^0 (e, ^ . (3.21) 

Thus, the two-dimensional vector ^(6,X) with components ^1^2(6,^) describes the 
transverse separation of two light rays at affine parameter X, whereas ^0 allows for a 
deviation component along the beam direction. Due to the linearisation introduced 
above, | depends linearly on 0, and the choice of A, assures that d|/ dX{X = 0) = 0. 
Hence, we can write the linear propagation equation 

l{l) = (D{l)e. (3.22) 
The 2x2 matrix © satisfies the Jacobi differential equation 

d^2)(X) 
dX2 

with initial conditions 



T(k)(D{X), (3.23) 



d© 

©(0) = O and —(0) = /. (3.24) 
dA 

The optical tidal matrix T (k) is symmetric, 

and its components depend on the curvature of the metric. and 3(z) denote 
the real and imaginary parts of the complex number z. Specifically, 

3i{l)^-^R^^{l)k^{l)k\l), (3.26) 

where R^viX) is the Ricci tensor at 7[|(X). The complex quantity !F (X) is more 
complicated and depends on the Weyl curvature tensor at Tq^)- The source of con- 
vergence 1{_ {X) leads to an isotropic focusing of light bundles, in that a circular light 
beam continues to have a circular cross section. In contrast, a non-zero source of 
shear f (X) causes an anisotropic focusing, changing the shape of the light bundle. 
For a similar set of equations, see, e.g. Blandford et al. (1991) and Peebles (1993). 
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To summarise this subsection, the transverse separation vector ^ of two infinitesi- 
mally close light rays, enclosing an angle at the observer, depends linearly on 9. 
The matrix which describes this linear mapping is obtained from the Jacobi differ- 
ential equation (3.23). The optical tidal matrix T can be calculated from the metric. 
This exact result from General Relativity is of course not easily applied to practical 
calculations in general space-times, as one first has to calculate the null geodesic 
7q(?i), and from that the components of the tidal matrix have to be determined. 
However, as we shall show next, the equations attain rather simple forms in the 
case of weak gravitational fields. 



3.2.2 Specialisation to Weak Gravitational Fields 

We shall now specialise the transport equation (3.23) to the situation of a homo- 
geneous and isotropic universe, and to weak gravitational fields. In a metric of the 
Robertson- Walker type (2.2, page 13), the source of shear must vanish identi- 
cally because of isotropy; otherwise preferred directions would exist. Initially cir- 
cular light bundles therefore remain circular. Hence, the optical tidal matrix T is 
proportional to the unit matrix, ^ {X) — (k) I, and the solution of (3.23) must be 
of the form 2) {X) = D{X) I . According to (3.22), the function Z)(X) is the angular- 
diameter distance as a function of the affine parameter. As we shall demonstrate 
next, this function indeed agrees with the angular diameter distance as defined in 
(2.43, page 22). 

To do so, we first have to find ^(k). The Ricci tensor deviates from the Einstein 
tensor by two terms proportional to the metric tensor g^, one involving the Ricci 
scalar, the other containing the cosmological constant. These two terms do not con- 
tribute to (3.26), since k^' is a null vector. We can thus replace the Ricci tensor in 
(3.26) by the energy-momentum tensor according to Einstein's field equation. Since 
k^ — c^^Cd— (1 -|-z)c^^03o, we have k*^ — —{l+z), and the spatial components of 
k^ are described by a direction and the constraint that k^ is a null vector. Then, us- 
ing the energy-momentum tensor of a perfect fluid with density p and pressure p, 
(3.26) becomes 

^(X) = -l^(p + ^)(l+z)2. (3.27) 

Specialising to a universe filled with dust, i.e. p = 0, we find from (2.16, page 16) 
and (2.19, page 16) 

^W = -^ (^y^o{l+zy. (3.28) 
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The transport equation (3.23) then transforms to 



In order to show that the solution of (3.29) with initial conditions D — and dD = 
dA, at X, = is equivalent to (2.43, page 22), we proceed as follows. First we note 
that (2.43) for zi = can be written as an initial-value problem. 



dw^ 



a 



with Z)ang(O) = and dDang = dw at w = 0, because of the properties of the func- 
tion fx; cf. (2.4, page 14). Next, we need a relation between X and w. The null 
component of the photon geodesic is = c{tQ — t). Then, from dx^ = k^'d'k, we 
obtain dX, = —acdt. Using dt = (1"^ da, we find 

da= — —dk, or dz=-^dA,. (3.31) 
ca ca^ 

Since cdt = —a dw for null rays, we have da = dr = —ac~^dw, which can be 
combined with (3.31) to yield 

dX = a^dw. (3.32) 
We can now calculate the analogous expression of (3.30) for D, 



d^ /D\ 2 d 
a 



dw^ \ a J dTi 



2 d fD 



dX\ a 



a^D"-a^a"D, (3.33) 



where a prime denotes differentiation with respect to X. From (3.31), a' — 
— (ac)^^d, and 

ld(aO' 1 d fd^\ 1 dH^ 

« = t: — ; — = :r-?-r { ^ ) = :r^—r- > (3-34) 



2 da 2c^ da J 2c^ da 

with H given in (2.31, page 19). Substituting (3.29) into the first term on the right- 
hand side of (3.33), and (3.34) into the second term, we immediately see that D 
satisfies the differential equation (3.30). Since D has the same initial conditions as 
Dang, they indeed agree. 

For computational convenience, we can also transform (3.29) into a differential 
equation for Z)(z). Using (3.31) and (2.31), one finds 



(l+nOz)-nA 1 



1 



+ 



2 



d^D 
'd^ 

d^ 3^ ^ 

h -Q.qD 

dz 2 



(3.35) 
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We next turn to the case of a weak isolated mass inhomogeneity with a spatial extent 
small compared to the Hubble distance cHq^, like galaxies or clusters of galaxies. 
In that case, the metric can locally be approximated by the post-Minkowskian line 
element 

d/= 1^1 + ^^ c^d?^- (^1-^^ dx^, (3.36) 

where dx^ is the line element of Euclidian three-space, and ^> is the Newtonian 
gravitational potential which is assumed to be weak, $ ^ c^. Calculating the cur- 
vature tensor of the metric (3.36), and using Poisson's equation for 4>, we find that 
for a light ray which propagates into the three-direction, the sources of convergence 
and shear are 

^=-^P' J =-^(*,ii-^,22 + 2i4>,i2) . (3.37) 



Now the question is raised as to how an isolated inhomogeneity can be combined 
with the background model of an expanding universe. There is no exact solution 
of Einstein's field equations which describes a universe with density fluctuations, 
with the exception of a few very special cases such as the Swiss-Cheese model 
(Einstein & Strauss 1945). We therefore have to resort to approximation methods 
which start from identifying 'small' parameters of the problem, and expanding the 
relevant quantities into a Taylor series in these parameters. If the length scales of 
density inhomogeneities are much smaller than the Hubble length cHq^ , the asso- 
ciated Newtonian gravitational potential 4> <^ (note that this does not imply that 
the relative density fluctuations are small!), and the peculiar velocities v <^ c, then 
an approximate metric is 



d*^ = a^{x) 




dw2+/|(w)dco2) 



(3.38) 



where dx = a~^ dt is the conformal time element, and O satisfies Poisson's equation 
with source Ap, the density enhancement or reduction relative to the mean cosmic 
density (Futamase 1989; Futamase & Sasaki 1989; Jacobs et al. 1993). 

In the case of weak metric perturbations, the sources of convergence and shear of 
the background metric and the perturbations can be added. Recalling that both %^ 
and !F are quadratic in oc so that the expressions in (3.37) have to be 

multiplied by (1 -Hz) ^, we find for the optical tidal matrix 

'^im = -\ {^)^ ^o{l+zfb,j-^^^{l^,ij + btj^^^^) , (3.39) 

where we have assumed that the local Cartesian coordinates are chosen such that 
the light ray propagates in JC3 -direction. The same result is obtained from the metric 
(3.38). 
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The lens equation as discussed in Sect. 3.1 can now be derived from the previous re- 
lations. To do so, one has to assume a geometrically thin matter distribution, i.e. one 
approximates the density perturbation Ap by a distribution which is infinitely thin 
in the direction of photon propagation. It is then characterised by its surface mass 

— * 

density S(^). The corresponding Newtonian potential 4> can then be inserted into 
(3.39). The integration over ^ 33 along the light ray vanishes, and (3.23) can be 

employed to calculate the change of dT) / dX across the thin matter sheet (the lens 
plane), whereas the components of 2) far from the lens plane are given by a lin- 
ear combination of solutions of the transport equation (3.29). Continuity and the 
change of derivative at A,d, corresponding to the lens redshift Zd, then uniquely fix 
the solution. If (D (0, A,s) denotes the solution at redshift Zs, then (D (0, A,s) = 3fi/30 
in the notation of Sect. 3.1. Line integration of this relation then leads to the lens 
equation (3.2). See Seitz et al. (1994) for details, and Pyne & Birkinshaw (1996) 
for an alternative derivation. 
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4 Principles of Weak Gravitational Lensing 

4.1 Introduction 

If the faint, and presumably distant, galaxy population is observed through the grav- 
itational field of a deflector, the appearance of the galaxies is changed. The tidal 
component of the gravitational field distorts the shapes of galaxy images, and the 
magnification associated with gravitational light deflection changes their apparent 
brightness. If all galaxies were intrinsically circular, any galaxy image would im- 
mediately provide information on the local tidal gravitational field. With galaxies 
being intrinsically elliptical, the extraction of significant information from individ- 
ual images is impossible, except for giant luminous arcs (see Fig. 10, page 37, for 
an example) whose distortion is so extreme that it can easily be determined. 

However, assuming that the galaxies are intrinsically randomly orientedQ, the 
strength of the tidal gravitational field can be inferred from a sample of galaxy 
images, provided its net ellipticity surmounts the Poisson noise caused by the finite 
number of galaxy images in the sample and by the intrinsic ellipticity distribution. 

Since lensing conserves surface brightness, magnification increases the size of 
galaxy images at a fixed surface-brightness level. The resulting flux enhancement 
enables galaxies to be seen down to fainter intrinsic magnitudes, and consequently 
the local number density of galaxy images above a certain flux threshold can be 
altered by lensing. 

In this section, we introduce the principles of weak gravitational lensing. In 
Sect. 4.2, we present the laws of the transformation between source and image 
ellipticities and sizes, and in particular we introduce a convenient definition of the 
ellipticity of irregularly-shaped objects. Sect. 4.3 focuses on the determination of 
the local tidal gravitational field from an ensemble of galaxy images. We derive 
practical estimators for the shear and compare their relative merits. The effects of 
magnification on the observed galaxy images are discussed in Sect. 4.4. We de- 
rive an estimate for the detectability of a deflector from its weak-lensing imprint 
on galaxy-image ellipticities in Sect. 4.5, and the final subsection 4.6 is concerned 
with practical aspects of the measurement of galaxy ellipticities. 

^ This assumption is not seriously challenged. Whereas galaxies in a cluster may have 
non-random orientations relative to the cluster centre, or pairs of galaxies may be aligned 
due to mutual tidal interaction, the faint galaxies used for lensing studies are distributed 
over a large volume enclosed by a narrow cone with opening angle selected by the angular 
resolution of the mass reconstruction (see below) and length comparable to the Hubble 
radius, since the redshift distribution of faint galaxies is fairly broad. Thus, the faint galaxies 
typically have large spatial separations, which is also reflected by their weak two-point 
angular auto-correlation (Brainerd et al. 1995; Villumsen et al. 1997). 
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4.2 Galaxy Shapes and Sizes, and their Transformation 



If a galaxy had elliptical isophotes, its shape and size could simply be defined in 

terms of axis ratio and area enclosed by a boundary isophote. However, the shapes 
of faint galaxies can be quite irregular and not well approximated by ellipses. In ad- 
dition, observed galaxy images are given in terms of pixel brightness on CCDs. We 
therefore require a definition of size and shape which accounts for the irregularity 
of images, and which is well adapted to observational data. 

Let 7(0) be the surface brightness of a galaxy image at angular position 0. We first 
assume that the galaxy image is isolated, so that / can be measured to large angular 

separations from the centre of the image, 

g^jJe„|/(9)]9 
/d^e«,[/(e)] 

where qi{I) is a suitably chosen weight function. For instance, if qi{I) = — 
is the Heaviside step function, is the centre of the area enclosed by a limiting 
isophote / = /th- Alternatively, if qi{I) = 7, is the centre of light. As a third ex- 
ample, if qi{I) = 7H(7 — 7th), is the centre of light within the limiting isophote 
7 = 7th. Having chosen qi{I), we define the tensor of second brightness moments, 

Jd^0,,[7(0)] (0,-0,) (0,-0,) 

(e.g. Blandford et al. 1991). In writing (4.1) and (4.2), we implicitly assumed that 
qi{I) is chosen such that the integrals converge. We can now define the size of an 
image in terms of the two invariants of the symmetric tensor Q. For example, we 
can define the size by 

0)=(!2ii!222-!2y'^' , (4.3) 

so that it is proportional to the solid angle enclosed by the limiting isophote if ^(7) 
is a step function. We quantify the shape of the image by the complex ellipticity 

Qn-Q22 + 2iQn 
^ Qn + Q22 

If the image has elliptical isophotes with axis ratio r < 1, then % = (1 — r^)(l + 
r^)~^exp(2i'd), where the phase of % is twice the position angle of the major 
axis. This definition assures that the complex ellipticity is unchanged if the galaxy 
image is rotated by %, for this rotation leaves an ellipse unchanged. 



If we define the centre of the source P and the tensor of second brightness moments 

'0' 



Q^f of the source in complete analogy to that of the image, i.e. with 7(0) replaced 
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in eqs. (4.1) and (4.2), and employ the conservation of surface brightness 

(3.10, page 49) and the linearised lens equation (3.13, page 49), we find that the 
tensors of second brightness moments of source and image are related through 

Q^'^ = JlQR^ = JIQJI , (4.5) 

where ^= ^(Q) is the Jacobian matrix of the lens equation at position 0. Defining 
further the complex ellipticity of the source X*-'^'* in analogy to (4.4) in terms of Q^^\ 
ellipticities transform according to 

M_ X-2g + gV 
^ l + |g|2_29t(gx*) 

(Schneider & Seitz 1995; similar transformation formulae were previously derived 
by Kochanek 1990 and Miralda-Escude 1991b), where the asterisk denotes com- 
plex conjugation, and g is the reduced shear 

,(9) = A. (4.7) 
1 -k(0) 

The inverse transformation is obtained by interchanging % and and replacing 
g by ~g in (4.6). Equation (4.6) shows that the transformation of image elliptici- 
ties depends only on the reduced shear, and not on the shear and the surface mass 
density individually. Hence, the reduced shear or functions thereof are the only 
quantities accessible through measurements of image ellipticities. This can also 
immediately be seen by writing J? as 

^ = (1-K)| I . (4.8) 

-gi 

The pre-factor (1 — k) only affects the size, but not the shape of the images. From 
(4.5) and (4.3), we immediately see that the sizes of source and image are related 
through 

a)=^(0)(o(-^) . (4.9) 



We point out that the dimension-less surface mass density K, and therefore also 
the shear y, depend not only on the redshift of the lens, but also on the redshift 
of the sources, because the critical surface mass density (3.7, page 48) involves 
the source redshift. More precisely, for fixed lens redshift z^, the lens strength is 
proportional to the distance ratio D^^/D^. This implies that the transformation (4.6) 
generally also depends on source redshift. We shall return to these redshift effects 
in Sect. 4.3, and assume for now that the lens redshift is sufficiently small so that 
the ratio D^^/D^ is approximately the same for all faint galaxy images. 
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Instead of %, we can define different ellipticity parameters (see 
Bonnet & Mellier 1995). One of these definitions turns out to be quite use- 
ful, namely 



211-222 + 21212 



(4.10) 



which we shall also call complex ellipticity. (Since we shall use the notation % and 
e consistently throughout this article, there should be no confusion from using the 
same name for two different quantities.) £ has the same phase as %, and for elliptical 
isophotes with axis ratio r< 1, |e| — {I — r){l + r)~^. e and % are related through 



2e 



(4.11) 



The transformation between source and image ellipticity in terms of e is given by 



8(^) = < 



l-g*8 

e* —g* 



for |g| < 1 



for \g\ > 1 



(4.12) 



(Seitz & Schneider 1997), and the inverse transformation is obtained by inter- 
changing £ and £^*^ and replacing g by —g in (4. 12). Although the transformation of 
£ appears more complicated because of the case distinction, we shall see in the next 
subsection that it is often useful to work in terms of £ rather than %; cf. eq. (4.17) 
below. 

For the case of weak lensing, which we define for the purpose of this section by K ^ 
1, IyI < 1, and thus \g\ < 1, (4.12) becomes £ E^'^ +g, provided |£| a; \e^^^\ < 1/2. 
Likewise, eq. (4.6) simplifies to%!^%^''^ + 2g in this case. 



4. 3 Local Determination of the Distortion 



As mentioned earlier, the observed ellipticity of a single galaxy image provides 
only little information about the local tidal gravitational field of the deflector, for 
the intrinsic ellipticity of the source is unknown. However, based on the assumption 
that the sources are randomly oriented, information on the local tidal field can be 
inferred from a local ensemble of images. Consider for example galaxy images at 
positions 0, close enough to a fiducial point so that the local lens properties K 
and ydo not change appreciably over the region encompassing these galaxies. The 
expectation value of their corresponding source ellipticities is assumed to vanish, 

E(x(")) = = E(£(")) . (4.13) 
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4. 3.1 All Sources at the Same Redshift 



We first consider tlie case tliat all sources are at the same redshift. Then, as men- 
tioned following eq. (3.13, page 49), the ellipticity of a circular source determines 
the ratio of the local eigenvalues of the Jacobian matrix . This also holds for the 
net image ellipticity of an ensemble of sources with vanishing net ellipticity. From 
(3.11, page 49), we find for the ratio of the eigenvalues of si in terms of the reduced 
shear g 

^=TXn- (4.14) 

Interestingly, if we replace ^ by 1/^*, r switches sign, but \r\ and the phase of £ 
remain unchanged. The sign of r cannot be determined observationally, and hence 
measurements cannot distinguish between g and \/g* . This is called local degen- 
eracy. Writing detJ? = (1 — k)^(1 — we see that the degeneracy between g 
and \/g* means that we cannot distinguish between observed images inside a crit- 
ical curve (so that detJ^ < and |^| > 1) or outside. Therefore, only functions of g 
which are invariant under g ^ \/g* are accessible to (local) measurements, as for 
instance the complex distortion 



Replacing the expectation value in (4.13) by the average over a local ensemble 
of image ellipticities, {%^^^) ^ E(x(^)) = 0, Schneider & Seitz (1995) showed that 
(X^*^) = is equivalent to 

where the m; are weight factors depending on |6,- — 9| which can give larger weight 
to galaxies closer to the fiducial point. Additionally, the can be chosen such 
as to account for measurement uncertainties in the image ellipticities by giving 
less weight to images with larger measurement error. Equation (4.16) has a unique 
solution 5, so that the distortion can locally be determined. It is readily solved by a 
quickly converging iteration starting from 5 = (%). 

The 5 obtained from (4.16) is an unbiased estimate of the distortion. Its dispersion 
about the true value depends on the dispersion of the intrinsic ellipticity distri- 
bution, and on the number of galaxy images. A fairly accurate estimate of the rms 
error of 6 is o§ <5iN~^l^, where N is the effective number of galaxies used for the 

local average, = (L";)^ (l^"?) ^ ■ overestimates the error for large values 
of |5| (Schneider & Seitz 1995). It is important to note that the expectation value of 
X is not 5, but differs from it by a factor which depends both on |5| and the intrin- 
sic ellipticity distribution of the sources. In contrast to that, it follows from (4.13) 
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and (4.12) that the expectation value of the complex ellipticity £ of the images is 
the reduced shear or its inverse, E(£) = g if |g| < 1 and E(e) = 1/g* if \g\ > 1 
(Schramm & Kayser 1995; Seitz & Schneider 1997). Hence, 

(£) = ^ (4.17) 

is an unbiased local estimate for g or 1/g*. The ellipticity parameter £ is useful ex- 
actly because of this property. If one deals with sub-critical lenses (i.e. lenses which 
are not dense enough to have critical curves, so that det (0) > everywhere), or 
with the region outside the critical curves in critical lenses, the degeneracy between 
g and 1/g* does not occur, and (e) is a convenient estimate for the local reduced 
shear. The rms error of this estimate is approximately ae(l — \g\^)N~^/'^ 
(Schneider et al. 1999), where Og is the dispersion of the intrinsic source ellipticity 
As we shall see in a moment, £ is the more convenient ellipticity parameter 
when the sources are distributed in redshift. 

The estimates for 5 and g discussed above can be derived without knowing the 
intrinsic ellipticity distribution. If, however, the intrinsic ellipticity distribution is 
known (e.g. from deep Hubble Space Telescope images), we can exploit this addi- 
tional information and determine 5 (or g) through a maximum-likelihood method 
(Gould 1995; Lombardi & Bertin 1998a). Depending on the shape of the intrinsic 
ellipticity distribution, this approach can yield estimates of the distortion which 
have a smaller rms error than the estimates discussed above. However, if the intrin- 
sic ellipticity distribution is approximately Gaussian, the rms errors of both meth- 
ods are identical. It should be noted that the intrinsic ellipticity distribution is likely 
to depend on the apparent magnitude of the galaxies, possibly on their redshifts, and 
on the wavelength at which they are observed, so that this distribution is not easily 
determined observationally. Knowledge of the intrinsic ellipticity distribution can 
also be used to determine 5 from the orientation of the images (that is, the phase of 
X) only (Kochanek 1990; Schneider & Seitz 1995; Deiser 1995, unpublished). This 
may provide a useful alternative to the method above since the orientation of images 
is much less affected by seeing than the modulus of %• We return to the practical 
estimate of the image ellipticities and the corresponding distortion in Sect. 4.5. 

In the case of weak lensing, defined by K <^ 1 and |y| <C 1, implying |g| <^ 1, we 
find from (4. 1 1^. 16) that 

5 (y) 
4.3.2 Sources Distributed in Redshift 

So far, we assumed that all source galaxies are at the same redshift, or more pre- 
cisely, that the ratio D^/D^, between the lens-source and observer-source distances 
is the same for all sources. This ratio enters into the scaling (3.7, page 48) of the 
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physical surface mass density E to the dimension-less convergence K. The deflec- 
tion angle, the deflection potential, and the shear are all linear in K, so that the dis- 
tance ratio Z)ds/^s is sufficient to specify the lens strength as a function of source 
redshift. Provided Zd < 0.2, this ratio is fairly constant for sources with redshift 
Zs ^ 0.8, so that the approximation used so far applies to relatively low-redshift de- 
flectors. However, for higher-redshift lenses, the redshift distribution of the sources 
must explicitly be taken into account. 

For a fixed lens redshift Zd, the dimension-less surface mass density and the shear 
depend on the source redshift. We define 

Z{z) = — r H(z - Zi) 

Mwjzd.z)] /g[w(0,oo)] 
fK[w{0,z)] fK[w[Zd,°°)] 

using the notation of Sect. 2.1 (page 12). The Heaviside step function accounts 

for the fact that sources closer than the deflector are not lensed. Then, k(0,z) = 
Z(z)k(9), and y(0,z) = Z(z)y(9) for a source at z, and K and y refer to a fictitious 
source at redshift infinity. The function Z(z) is readily evaluated for any cosmo- 
logical model using (2.41, page 22) and (2.4, page 14). We plot Z(z) for various 
cosmologies and lens redshifts in Fig. 12. 

The expectation value for the ellipticity of images with redshift z now becomes 



E[e(z)] = <^ 



l-Z(z)K 
l-Z(z)K 



for ij{z)>0 



for /i(z) < 



(4.20) 



z{z)T 

where //(z) is the magnification as a function of source redshift, 

M{z)^{[l-Z{z)xf-Z\z)\y\'y' . (4.21) 

We refer to sub-critical lensing if /i(z) > for all redshifts, which is equivalent to 
l-K-|y| >0. 

Without redshift information, only the mean ellipticity averaged over all redshifts 
can be observed. We first consider this case, for which the source redshift distribu- 
tion is assumed to be known. We define the probability p^{z) dz that a galaxy image 
(in the selected magnitude range) has a redshift within dz of z. The image redshift 
distribution will in general be different from the source redshift distribution since 
magnified sources can be seen to higher redshifts than unlensed ones. Therefore, 
the redshift distribution will depend on the local lens parameters K and y through the 
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Fig. 12. The function Z{z) defined in eq. (4.19) describes the relative lens strength as a 
function of source redshift z. We show Z(z) for three cosmological models as indicated 
in the figure, and for three values for the lens redshift, Zd = 0.2,0.5,0.8. By definition, 
Z(z) ^ as z ^ Zd» and Z(z) ^ 1 as z ^ oo. For sources close to the deflector, Z{z) varies 
strongly in a way depending relatively weakly on cosmology. 

magnification (4.21). If, however, the magnification is small, or if the redshift dis- 
tribution depends only weakly on the flux, the simplification of identifying the two 
redshift distributions is justified. We shall drop it later. Given Pz{z), the expectation 
value of the image ellipticity becomes the weighted average 

E(8) = j dzp,(z)E[8(z)] =Y[^(K,Y) + lYr'j^(K,Y)] , (4.22) 

with 



Jn{z)>0 l-Z(z)K 

Y{K,y)^f dzp,(z) ^~fy^ , (4.23) 

and the integration boundaries depend on the values of K and |y| through the mag- 
nification. 

If the lens is sub-critical, ij{z) > for all z. Then Y — 0, and only the first term in 
(4.22) remains. Also, X no longer depends on Y in this case, and E(8) = yX{K.). 
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An accurate approximation for X{k), valid for K < 0.6, has been derived in 
Seitz & Schneider (1997), 



where {Z") = J dzpz{z)Z". 



Specialising further to the weak-lensing regime, the expectation value of the image 
ellipticity is simply 

E(£)^(Z)y. (4.25) 

Thus, in the weak-lensing case, a source redshift distribution can be collapsed on a 
single redshift Zs satisfying Z(zs) = (Z). 

We now drop the simplification introduced above and define no{S,z)dSdz as the 
number of galaxy images per unit solid angle with flux within dS of S and redshift 
within dz of z in the absence of lensing. At a point 9 with surface mass density K 
and shear y, the number density can be changed by magnification. Images of a fixed 
set of sources are distributed over a larger solid angle, reducing the number density 
by a factor //~^(z). On the other hand, the magnification allows the observation of 
fainter sources. In total, the expected number density becomes 

n{S,z) = ^^no(^,z) , (4.26) 

with iu{z) given in (4.21). This yields the redshift distribution 

P^''^' M^{z)Jdz'M-\z')no\M-\z)S,z'] ' ^^'^'^ 

which depends on the flux S and the local lens parameters K and y through the 
magnification. This function can now be substituted for p^{z) in eq. (4.22). 



4.3.3 Practical Estimates of the Shear 

We saw before that (e) = Y^iUfy/Y^i^i is an unbiased estimate of the local reduced 
shear g if all sources are at the same redshift. We now generalise this result for 
sources distributed in redshift. Then, the expectation value of £ is no longer a simple 
function of K and y, and therefore estimates of y for an assumed value for K will be 
derived. 

We first assume that redshifts for individual galaxies are unavailable, but that only 
the normalised redshift distribution Pz{z) is known, or the distribution in eq. (4.27). 
Replacing the expectation value of the image ellipticity by the mean, eq. (4.22) 
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implies that the solution y^^) of 

Y= [X{K,y) + \y\'^Y{K,y)Y' (e) (4.28) 

provides an unbiased estimator for the shear y. This is not a particularly explicit 
expression for the shear estimate, but it is still extremely useful, as we shall see in 
the next section. The shear estimate considerably simplifies if we assume a sub- 
critical lens. Then, 

^i,sc) ^ (e)x-i(K) ^ - ^1^) > (4.29) 

where we used eq. (4.24) in the second step. Specialising further to weak lensing, 
the shear estimate simplifies to 

y(1'^i) = (8)(Z)-i. (4.30) 



Next, we assume that the redshifts of all galaxy images are known. At first sight, 
this appears entirely unrealistic, because the galaxy images are so faint that a com- 
plete spectroscopic survey at the interesting magnitude limits seems to be out of 
reach. However, it has become clear in recent years that accurate redshift estimates, 
the so-called photometric redshifts, can be obtained from multi-colour photome- 
try alone (see, e.g., Connolly et al. 1995). The accuracy of photometric redshifts 
depends on the number of wave bands for which photometry is available, the pho- 
tometric accuracy, and the galaxy type; typical errors are Az ~ 0.1 for faint, high- 
redshift galaxies. This uncertainty is small compared to the range over which the 
function Z(z) varies appreciably, so that photometric redshifts are (almost) as good 
as precise spectroscopic redshifts for our purposes. 

If the redshifts Zi of the galaxies are known, more precise shear estimates than 
before can be derived. Consider the weighted sum F = J^i m,- |£,- — E(e,) p, where the 
expectation value is given by eq. (4.20), and Z = Zi = Z{zi). For an assumed value 
of K, an unbiased estimate of y is given by the y^^) minimising F. Due to the case 
distinction in eq. (4.20), this estimator is complicated to write down analytically, 
but can easily be calculated numerically. 

This case distinction is no longer necessary in the sub-critical case, for which the 
resulting estimator reads 



^(2,sc) ^ HjHiZi^iji -ZjK) ^ 
ZiUiZfil-ZiK)- 



In the case of weak lensing, this becomes 

I 

ZiUiZ'i 



^2,wi) ^ . (4.32) 
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We now compare the accuracy of the shear estimates with and without redshift in- 
formation of the individual galaxies. For simplicity, we assume sub-critical lens- 
ing and set all weight factors to unity, Ui — I. The dispersion of the estimate 
y(l,sc) _ (A^Z)^^ ^, £/ for galaxy images is 




(4.33) 



The expectation value in the final expression can be estimated noting that the image 

ellipticity is to first order given by £, = ef^ +y, and that the intrinsic ellipticities 
are uncorrelated. If we further assume that the redshifts of any two galaxies are 
uncorrelated, we find 



= x2(K)|Y|2 + 5;y(a||Y|2 + ai) , (4.34) 

where we used the definition (4.23) of X (k) , and defined o| (k) = (Z^ ( 1 - Zk.)'^) - 
X^. Angular brackets denote averages over the redshift distribution p^. Inserting 
(4.34) into (4.33) yields 

^2(^,,))^4Mlti, (4.35) 
Likewise, the dispersion of the estimate y^'^^^'^^ is 



,2 / (2,sc)^ ^ I.^z,(i-z,k)-Hi-z,k)^^e(b,b-) 
^ ^ [L-zf(i-z,K)-f 



We used eq. (4.34), but noted that Z is now no longer a statistical variable, so that 
we can put = in (4.34). In the final step, we have replaced the denominator 
by its expectation value under ensemble averaging. We then find the ratio of the 
dispersions. 



a2 ^yCl.sc) 

a2(Y(2,sc)) - ' III ^2jy^ ' 

We thus see that the relative accuracy of these two estimates depends on the frac- 
tional width of the distribution of Z/( 1 — Zk), and on the ratio between the disper- 
sion of this quantity and the ellipticity dispersion. Through its explicit dependence 
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on IyP, and through the dependence of Ox andZ on K, the relative accuracy also de- 
pends on the lens parameters. Quantitative estimates of (4.37) are given in Fig. 13. 



T 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 




J I I I I I I I I I I I I I I I I I I I L 



0.2 0.4 0.6 0.8 1 

Fig. 13. The fractional accuracy gain in the shear estimate due to the knowledge of 
the source redshifts is plotted, more precisely the deviation of the square root of (4.37) 
from unity in per cent. The four curves shown correspond to two different values of the 
mean source redshift, and to the cases without lensing (k = = y), and with lensing 
(k = 0.3 = |y|), labelled NL and L, respectively. We assumed the redshift distribution (2.69) 
with P = 3/2, and an Einstein-de Sitter cosmology. As expected, the higher the lens red- 
shift Zd, the more substantially is the shear estimate improved by redshift information, since 
for low values of zd, the function Z(z) is nearly constant. Furthermore, the lower the mean 
redshift of the source distribution, the more important the knowledge of individual redshifts 
becomes, for example to distinguish between foreground and background galaxies. Finally, 
redshift information is relatively more important for larger lens strength. 

The figure shows that the accuracy of the shear estimate is noticeably improved, in 
particular once the lens redshift becomes a fair fraction of the mean source redshift. 
The dependence of the lens strength on the deflector redshift implies that the lens 
signal will become smaller for increasing deflector redshift, so that the accuracy 
gained by redshift information becomes significant. In addition, the assumptions 
used to derive (4.35) were quite optimistic, since we have assumed in (4.34) that 
the sample of galaxies over which the average is taken is a fair representation of 
the galaxy redshift distribution Pz{z). Given that these galaxies come from a small 
area (small enough to assume that K and y are constant across this area), and that 
the redshift distribution of observed galaxies in pencil beams shows strong correla- 
tions (see, e.g., Broadhurst et al. 1990, Steidel et al. 1998, Cohen et al. 1999), this 
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assumption is not very realistic. Indeed, the strong clustering of galaxy redshifts 
means that the effective Ox will be considerably larger than the analytical estimate 
used above. In any case, redshift information on the source galaxies will substan- 
tially improve the accuracy of weak lensing results. 

4.4 Magnification Effects 

In addition to the distortion of image shapes, by which the (reduced) shear can be 
measured locally, gravitational light deflection also magnifies the images, leaving 
the surface brightness invariant. The magnification changes the size, and therefore 
the flux, of individual galaxy images. Moreover, for a fixed set of sources, the num- 
ber density of images decreases by a factor /j as the sky is locally stretched. Com- 
bining the latter effect with the flux magnification, the lensed and unlensed source 
counts are changed according to (4.26). Two strategies to measure the magnifica- 
tion effect have been suggested in the literature, namely either through the change 
in the local source counts, perhaps combined with the associated change (4.27) in 
the redshift distribution (Broadhurst et al. 1995), or through the change of image 
sizes at fixed surface brightness (Bartelmann & Narayan 1995). 

4.4. 1 Number Density Effect 

Let no(> S,z)<^z be the unlensed number density of galaxies with redshift within 
dz of z and with flux larger than S. Then, at an angular position where the magni- 
fication is ;u(0,z), the number counts are changed according to (4.26), 

n(>5,z) = ^— nof >^^,z 1 . (4.38) 

Accordingly, magnification can either increase or decrease the local number counts, 
depending on the shape of the unlensed number-count function. This change of 
number counts is called magnification bias, and is a very important effect for grav- 
itational lensing of QSOs (see Schneider et al. 1992 for references).^ 

Magnification allows the observation of fainter sources. Since the flux from the 
sources is correlated with their redshift, the redshift distribution is changed accord- 

' Bright QSOs have a very steep number-count function, and so the flux enhancement 
of the sources outweighs the number reduction due to the stretching of the sky by a large 
margin. Whereas the lensing probability even for a high-redshift QSO is probably too small 
to affect the overall sources counts significantly, the fraction of multiply-imaged QSOs in 
flux-limited samples is increased through the magnification bias by a substantial factor over 
the probability that any individual QSO is multiply imaged (see, e.g. Turner et al. 1984; 
Narayan & Wallington 1993 and references therein). 
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ingly, 

no\> iu^^(z)S,z\ 

in analogy to the redshift distribution (4.27) at fixed flux S. Since the objects of 
interest here are very faint, spectroscopic redshift information is in general difficult 
to obtain, and so one can only observe the redshift-integrated counts 

n{>S) = J dzj^no{>^-Hz)S,z) . (4.40) 

The number counts of faint galaxies are observed to very closely follow a power 
law over a wide range of fluxes, and so we write the unlensed counts as 

no{>S,z) = aS-''po{z;S), (4.41) 

where the exponent a depends on the wave band of the observation 
(e.g. Small etal. 1995a), and po{z;S) is the redshift probability distribution of 
galaxies with flux > S. Whereas this redshift distribution is fairly well known for 
brighter galaxies which are accessible to current spectroscopy, little is known about 
the faint galaxies of interest here. The ratio of the lensed and unlensed source counts 
is then found by inserting (4.41) into (4.40), 

= / dz/i«-i(z)po {z;n-\z)S) . (4.42) 

We should note that the lensed counts do not strictly follow a power law in S, for 
po depends on z. Since the redshift distribution po{z,S) is currently unknown, the 
change of the number counts due to the magnification cannot be predicted. For very 
faint flux thresholds, however, the redshift distribution is likely to be dominated by 
galaxies at relatively high redshift. For lenses at fairly small redshift (say Zd ^ 0.3), 
we can approximate the redshift-dependent magnification //(z) by the magnification 
// of a fiducial source at infinity, in which case 

"(>'^) _„a-l 



no{> S) 



= lf--^. (4.43) 



Thus, a local estimate of the magnification can be obtained through (4.43) and from 
a measurement of the local change of the number density of images. If the slope of 
the source counts is unity, a = 1, there will be no magnification bias, while it will 
cause a decrease of the local number density for flatter slopes. Broadhurst (1995) 
pointed out that one can immediately obtain (for sub-critical lensing, i.e. det^ > 0) 
an estimate for the local surface mass density from a measurement of the local mag- 
nification and the local reduced shear g,Yi — I — [//(I — In the absence 
of shape information, (4.43) can be used in the weak lensing limit [where K <^ 1, 
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<^ 1, so that // (1 + 2k)] to obtain an estimate of the surface mass density, 

n(>5)-no(>5) 1 



4.4.2 Size Effect 

Since lensing conserves surface brightness, the magnification can be obtained from 
the change in galaxy-image sizes at fixed surface brightness. Let / be some conve- 
nient measure of the surface brightness. For example, if co is the solid angle of an 
image, defined by the determinant of the tensor of second brightness moments as 
in (4.3), one can set / = S/d). 

Denoting by n((0,/,z)da) the number density of images with surface brightness /, 
redshift z, and solid angle within dto of to, the relation between the lensed and the 
unlensed number density can be written 

1 / (0 \ 
n(co,/,z) = ^no -,I,z . (4.45) 



7 ' 

For simplicity, we only consider the case of a moderately small lens redshift, so that 
the magnification can be assumed to be locally constant for all images, irrespective 
of galaxy redshift. We can then drop the variable z here. The mean image size 
(to) (/) at fixed surface brightness / is then related to the mean image size (to)o(/) 
in the absence of lensing through 

(a))(/)=A/(a))o(/). (4.46) 

If the mean image size in the absence of lensing can be measured (e.g. by deep 
HST exposures of blank fields), the local value ju of the magnification can there- 
fore be determined by comparing the observed image sizes to those in the blank 
fields. This method has been discussed in detail in Bartelmann & Narayan (1995). 
For instance, if we assume that the logarithm of the image size is distributed as a 
Gaussian with mean (lna))o(/) and dispersion we obtain an estimate for the 
local magnification from a set of N galaxy images, 

A typical value for the dispersion is o{I) ^ 0.5 (Bartelmann & Narayan 1995). 



4. 4. 3 Relative Merits of Shear and Magnification Effect 

It is interesting to compare the prospects of measuring shear and magnification 
caused by a deflector. We consider a small patch of the sky containing an expected 
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number N of galaxy images (in the absence of lensing), which is sufficiently small 
so that the lens parameters K and y can be assumed to be constant. We also restrict 
the discussion to weak lensing case. 

The dispersion of a shear estimate from averaging over galaxy ellipticities is o\/N, 
so that the signal-to-noise ratio is 

I") J^Vn. (4.48) 
N/ shear 

According to (4.44), the expected change in galaxy number counts is | AA^^I = 2k| a — 
1 1 A^. Assuming Poissonian noise, the signal-to-noise ratio in this case is 



j =2K|a-l|v^. (4.49) 

/ counts 



counts 

Finally, the signal-to-noise ratio for the magnification estimate (4.47) is 

■S\ 2k 



M/ ..rn-^' (4.50) 



assuming all a(/) are equal. 
Comparing the three methods, we find 

(S/N) 

shear lYl 1 (S/N) 

counts 

(S/N)counts ~ 1^20e|a-l| ' (S/N) size 



= 2a(7)|a-l|. (4.51) 



If the lens situation is such that K fa |y| as for isothermal spheres, the first of 
eqs. (4.51) implies that the signal-to-noise of the shear measurement is consid- 
erably larger than that of the magnification. Even for number-count slopes as flat 
as a ~ 0.5, this ratio is larger than five, with Oe ~ 0.2. The second of eqs. (4.51) 
shows that the size effect yields a somewhat larger signal-to-noise ratio than the 
number-density effect. We therefore conclude from these considerations that shear 
measurements should yield more significant results than magnification measure- 
ments. 

This, however, is not the end of the story. Several additional considerations come 
into play when these three methods of measuring lensing effects are compared. 
First, the shear measurement is the only one for which we know precisely what 
to expect in the absence of lensing, whereas the other two methods need to com- 
pare the measurements with calibration fields void of lensing. These comparisons 
require very accurate photometry. Second, eq. (4.49) overestimates the signal-to- 
noise ratio since we assumed Poissonian errors, while real galaxies are known to 
cluster even at very faint magnitudes (e.g., Villumsenet al. 1997), and so the er- 
ror is substantially underestimated. Third, as we shall discuss in Sect. 4.6, obser- 
vational effects such as atmospheric seeing affect the observable ellipticities and 
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sizes of galaxy images, whereas the observed flux of galaxies is much less affected. 
Hence, the shear and size measurements require better seeing conditions than the 
number-count method. Both the number counts and the size measurements (at fixed 
surface brightness) require accurate photometry, which is not very important for the 
shear measurements. As we shall see in the course of this article, most weak-lensing 
measurements have indeed been obtained from galaxy ellipticities. 



4.5 Minimum Lens Strength for its Weak Lensing Detection 

After our detailed discussion of shear estimates and signal-to-noise ratios for local 
lensing measurements, it is interesting to ask how strong a deflecting mass distri- 
bution needs to be for a weak lensing measurement to recognise it. Our simplified 
consideration here suffices to gain insight into the dependence on the lens mass 
of the signal-to-noise ratio for a lens detection, and on the redshifts of lens and 
sources. 

We model the deflector as a singular isothermal sphere (see Sect. 3.1.5, page 50). 
Let there be galaxy images with ellipticities 8; in an annulus centred on the lens 
and bounded by angular radii Gin < G; < Gout- For simplicity, we restrict ourselves 
to weak lensing, so that E(e) ~ y. For an axially- symmetric mass distribution, the 
shear is always tangentially oriented relative to the direction towards the mass cen- 
tre, which is expressed by eq. (3.18) on page 51. We therefore consider the ellip- 
ticity component projected onto the tangential direction. It is formally defined by 
8t = — 9t(ee~^^*l'), where (p is the polar angle of the galaxy position relative to the 
lens centre [see (3.18), page 51]. We now define an estimator for the lens strength 
by 

N 

X=£a,et;. (4.52) 

i=l 

The factors a, = a{Qi) are arbitrary at this point, and will be chosen later such as to 
maximise the signal-to-noise ratio of the estimator (4.52). Note that the expectation 
value of X is zero in the absence of lensing, so that a significant non-zero value of 
X signifies the presence of a lens. The expectation value for an isothermal sphere 
is E(Z) = 0ELi<2('/(2Qi)' where we used (3.18, page 51), and 

E(Z2)= £ aiajE{eu£tj) = [E{X)f + ^f^af. (4.53) 

U=l ^ i=l 

We employed E(£t/£tj) = Yt(0i)Yt(0j) +S;j'Je/2 here, and the factor two is due to 
the fact that the ellipticity dispersion only refers to one component of the ellipticity, 
while Oe is defined as the dispersion of the two-component ellipticity. Therefore, 
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the signal-to-noise ratio for a detection of the lens is 



e^I^ (4.54) 



Differentiating (S/N) with respect to Uj, we find that (S/N) is maximised if the 
Ui are chosen oc 07 ^ Inserting this choice into (4.54) yields S/N = 2~^/^0EOe"^ 

vLi^i ) ■ We now replace the sum by its ensemble average over the annulus, 

(ijQr^) = A^(e-2) = 2«7rln(eout/ein), where we used = Tinie^^j - QfJ, with 
the number density of galaxy images n. Substituting this result into (4.54), and 
using the definition of the Einstein radius (3.17, page 51), the signal-to-noise ratio 
becomes 




As expected, the signal-to-noise ratio is proportional the square root of the number 
density of galaxies and the inverse of the intrinsic ellipticity dispersion. Further- 
more, it is proportional to the square of the velocity dispersion Ov. Assuming the 
fiducial values given in eq. (4.55) and a typical value of (Dds/Ds) ~ 0.5, lenses 
with velocity dispersion in excess of ~ 600 km s^^ can be detected with a signal- 
to-noise > 6. This shows that galaxy clusters will yield a significant weak lensing 
signal, and explains why clusters have been the main target for weak-lensing re- 
search up to now. Individual galaxies with ~ 200kms~^ cannot be detected with 
weak-lensing techniques. If one is interested in the statistical properties of the mass 
distribution of galaxies, the lensing effects of A^gai galaxies need to be statistically 
superposed, increasing (S/N) by a factor of \/Nga\. Thus, it is necessary to super- 
pose several hundred galaxies to obtain a significant galaxy-galaxy lensing signal. 
We shall return to this topic in Sect. 7 on page 156. 

We finally note that (4.55) also demonstrates that the detection of lenses will be- 
come increasingly difficult with increasing lens redshift, as the last factor is a sen- 
sitive function of Zd- Therefore, most lenses so far investigated with weak-lensing 
techniques have redshifts below 0.5. High-redshift clusters have only recently be- 
come the target of detailed lensing studies. 
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4. 6 Practical Consideration for Measuring Image Shapes 
4. 6. 1 General Discussion 

Real astronomical data used for weak lensing are supplied by CCD images. The 
steps from a CCD image to a set of galaxy images with measured ellipticities are 
highly non-trivial and cannot be explained in any detail in the frame of this review. 
Nevertheless, we want to mention some of the problems together with the solutions 
which were suggested and applied. 

The steps from CCD frames to image ellipticities can broadly be grouped into four 
categories; data reduction, image detection, shape determination, and corrections 
for the point-spread function. The data-reduction process is more or less standard, 
involving de-biasing, flat-fielding, and removal of cosmic rays and bad pixels. For 
the latter purpose, it is essential to have several frames of the same field, slightly 
shifted in position. This also allows the the flat field to be determined from the 
images themselves (a nice description of these steps is given in Mould et al. 1994). 
To account for telescope and instrumental distortions, the individual frames have 
to be re-mapped before being combined into a final image. In order to do this, 
the geometric distortion has to be either known or stable. In the latter case, it can 
be determined by measuring the positions and shapes of stellar images (e.g., from 
a globular cluster). In Mould et al. (1994), the classical optical aberrations were 
determined and found to be in good agreement with the system's specifications 
obtained from ray-tracing analysis. 

With the individual frames stacked together in the combined image, the next step 
is to detect galaxies and to measure their shapes. This may appear simple, but is in 
fact not quite as straightforward, for several reasons. Galaxy images are not neces- 
sarily isolated on the image, but they can overlap, e.g. with other galaxies. Since 
weak-lensing observations require a large number density of galaxy images, such 
merged images are not rare. The question then arises whether a detected object is a 
single galaxy, or a merged pair, and depending on the choice made, the measured 
ellipticities will be much different. Second, the image is noisy because of the finite 
number of photons per pixel and the noise intrinsic to the CCD electronics. Thus, 
a local enhancement of counts needs to be classified as a statistically significant 
source detection, and a conservative signal-to-noise threshold reduces the number 
of galaxy images. Third, galaxy images have to be distinguished from stars. This is 
not a severe problem, in particular if the field studied is far from the Galactic plane 
where the number density of stars is small. 

Several data-analysis software packages exist, such as FOCAS 
(Jarvis & Tyson 1981) and SExtractor (Bertin & Amouts 1996). They pro- 
vide routines, based on algorithms developed from experience and simulated data, 
for objective selection of objects and measuring their centroids, their multipole 
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moments, their magnitudes, and classify them as stars or extended objects. 
Kaiser etal. (1995) developed their own object detection algorithm. It is based 
on convolving the CCD image with two-dimensional Mexican hat-shaped filter 
functions of variable width 0s . For each value of 0s, the maxima of the smoothed 
intensity map are localised. Varying 0s, these maxima form curves in the three- 
dimensional space spanned by and 0s. Along each such curve, the significance of 
a source detection is calculated, and the maximum of the significance is defined as 
the location of an object with corresponding size 0s. 

Once an object is found, the quadrupole moments can in principle be obtained from 
(4.2). In practice, however, this is not necessarily the most practical definition of the 
moment tensor. The function <?/(/) in (4.2) should be chosen such that it vanishes 
for surface brightnesses close to and smaller than the sky brightness; otherwise, 
one would sample too much noise. On the other hand, if qi is cut off at too bright 
values of /, the area within which the quadrupole moments are measured becomes 
too small, and the effects of seeing (see below) become overwhelming. Also, with 
a too conservative cut-off, many galaxy images would be missed. Assume, for in- 
stance, that qi{I) — /H(/ — /(h). One would then choose /th such that it is close to, 
but a few Onoise above the sky background, and the quadrupole moments would 
then be measured inside the resulting limiting isophote. Since this isophote is close 
to the sky background, its shape is affected by sky noise. This implies that the mea- 
sured quadrupole moments will depend highly non-linearly on the brightness on the 
CCD; in particular, the effect of noise will enter the measured ellipticities in a non- 
linear fashion. A more robust measurement of the quadrupole moments is obtained 
by replacing the weight function qj[l(Q)] in (4.2) by / W(0), where W(0) explicitly 
depends on 0. Kaiser et al. (1995) use a Gaussian of size 0s as their weight function 
W , i.e., the size of their W is the scale on which the object was detected at high- 
est significance. It should be noted that the quadrupole moments obtained with a 
weight function W(Q) do not obey the transformation law (4.5), and therefore, the 
expectation value of the ellipticity, E(e), will be different from the reduced shear g. 
We return to this issue further below. 

Another severe difficulty for the determination of the local shear is atmospheric 
seeing. Due to atmospheric turbulence, a point-like source will be seen from the 
ground as an extended image; the source is smeared-out. Mathematically, this can 
be described as a convolution. If 7(0) is the surface brightness before passing the 
Earth's atmosphere, the observed brightness distribution /(°''^) (0) is 

7(oM(e) = j d2'd/(^)P(0-'&) , (4.56) 

where P{Q) is the point- spread function (PSF) which describes the brightness dis- 
tribution of a point source on the CCD. P(0) is normalised to unity and centred 
on 0. The characteristic width of the PSF is called the size of the seeing disc. The 
smaller it is, the less smeared the images are. A seeing well below 1 arc second is 
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required for weak-lensing observations, and there are only a handful of telescope 
sites where such seeing conditions are regularly met. The reason for this strong 
requirement on the data quality lies in the fact that weak-lensing studies require a 
high number density of galaxy images, i.e., the observations have to be extended to 
faint magnitudes. But the characteristic angular size of faint galaxies is below 1 arc 
second. If the seeing is larger than that, the shape information is diluted or erased. 

The PSF includes not only the effects of the Earth's atmosphere, but also pointing 
errors of the telescope (e.g., caused by wind shake). Therefore, the PSF will in gen- 
eral be slightly anisotropic. Thus, seeing has two important effects on the observed 
image ellipticities: Small elliptical images become rounder, and the anisotropy of 
the PSF introduces a systematic, spurious image ellipticity. The PSF can be deter- 
mined directly from the CCD once a number of isolated stellar images are identi- 
fied. The shape of the stars (which serve as point sources) reflects the PSF. Note 
that the PSF is not necessarily constant across the CCD. If the number density of 
stellar images is sufficiently large, one can empirically describe the PSF variation 
across the field by a low-order polynomial. An additional potential difficulty is the 
chromaticity of the PSF, i.e. the dependence of the PSF on the spectral energy distri- 
bution of the radiation. The PSF as measured from stellar images is not necessarily 
the same as the PSF which applies to galaxies, due to their different spectra. The 
difference of the PSFs is larger for broader filters. However, it is assumed that the 
PSF measured from stellar images adequately represents the PSF for galaxies. 

In the idealised case, in which the quadrupole moments are defined with the weight 
function qi{I) = I, the effect of the PSF on the observed image ellipticities can 
easily be described. If Pij denotes the quadrupole tensor of the PSF, defined in 

complete analogy to (4.2), then the observed quadrupole tensor q\°^^^ is related to 

the true one by Q^f^^ = Pij -h Qij (see Valdes et al. 1983). The ellipticity % then 
transforms like 

^(obs) ^ + , (4.57) 

where 

^^Pn±P2^, (PSF)^ Al-^22 + 2iPl2 
211+222 ' ^ ^^11+^^22 ■ 

Thus, T expresses the ratio of the PSF size to the image size before convolution, 
and X^^^^^ is the PSF ellipticity. It is evident from (4.57) that the smaller T, the 
less ji^^^^ deviates from %. In the limit of very large T, jjs>^^^ approaches X^^^^^. 
In principle, the relation (4.57) could be inverted to obtain % from yjP^^), How- 
ever, this inversion is unstable unless T is sufficiently small, in the sense that noise 
affecting the measurement of is amplified by the inversion process. Unfortu- 
nately, these simple transformation laws only apply for the specific choice of the 
weight function. For weighting schemes that can be applied to real data, the result- 
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ing transformation becomes much more complicated. 

If a galaxy image features a bright compact core which emits a significant fraction 
of the galaxy's light, this core will be smeared out by the PSF. In that case, ;^("*'^) 
may be dominated by the core and thus contain little information about the galaxy 
ellipticity. This fact motivated Bonnet & Mellier (1995) to define the quadrupole 
moments with a weight function W(9) which not only cuts off at large angular 
separations, but which is also small near = 0. Hence, their weight function q is 
significantly non-zero in an annulus with radius and width both being of the order 
of the size of the PSF. 

The difficulties mentioned above prohibit the determination of the local reduced 
shear by straight averaging over the directly measured image ellipticities. This av- 
erage is affected by the use of a angle-dependent weight function W in the prac- 
tical definition of the quadrupole moments, by the finite size of the PSF and its 
anisotropy, and by noise. Bonnet & Mellier (1995) have performed detailed simula- 
tions of CCD frames which resemble real observations as close as possible, includ- 
ing an anisotropic PSF. With these simulations, the efficiency of object detection, 
the accuracy of their centre positions, and the relation between true and measured 
image ellipticities can be investigated in detail, and so the relation between mean el- 
lipticity and (reduced) shear can approximately be calibrated. Wilson et al. (1996) 
followed a very similar approach, except that the analysis of their simulated CCD 
frames was performed with FOCAS. Assuming an isotropic PSF, the mean image 
ellipticity is proportional to the reduced shear, g ~ /(e), with a correction fac- 
tor / depending on the limiting galaxy magnitude, the photometric depth of the 
image, and the size of the seeing disk. For a seeing of Of'8, Bonnet & Mellier ob- 
tained a correction factor / ~ 6, whereas the correction factor in Wilson et al. for 
the same seeing is / ~ 1.5. This large difference is not a discrepancy, but due to 
the different definitions of the quadrupole tensor. Although the correction factor 
is much larger for the Bonnet & Mellier method, they show that their measured 
(and calibrated) shear estimate is more accurate than that obtained with FOCAS. 
Kaiser et al. (1995) used CCD frames taken with WFPC2 on board HST which are 
unaffected by atmospheric seeing, sheared them, and degraded the resulting im- 
ages by a PSF typical for ground-based images and by adding noise. In this way, 
they calibrated their shear measurement and tested their removal of an anisotropic 
contribution of the PSF. 

However, calibrations relying on simulated images are not fully satisfactory 
since the results will depend on the assumptions underlying the simulations. 
Kaiser et al. (1995) and Luppino & Kaiser (1997) presented a perturbative ap- 
proach for correcting the observed image ellipticities for PSF effects, with ad- 
ditional modifications made by Hoekstra et al. (1998) and Hudson et al. (1998). 
Since the measurement of ellipticities lies at the heart of weak lensing studies, we 
shall present this approach in the next subsection, despite its being highly technical. 
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4.6.2 The KSB Method 



Closely following the work by Kaiser et al. (1995), this subsection provides a re- 
lation between the observed image ellipticity and a source ellipticity known to be 
isotropically distributed. The relation corrects for PSF smearing and its anisotropy, 
and it also takes into account that the transformation (4.5) no longer applies if the 
weight factor explicitly depends on 6. 

We consider the quadrupole tensor 

Qij = j d2e(e,-e,)(e,-e,)7(e)w(|e-l|Va2) , (4.59) 

where W contains a typical scale o, and is defined as in (4.1), but with the new 
weight function. Note that, in contrast to the definition (4.2), this tensor is no longer 
normalised by the flux, but this does not affect the definition (4.4) of the complex 
ellipticity. 

The relation between the observed surface brightness P^^(Q) and the true sur- 
face brightness I is given by (4.56). We assume in the following that P is nearly 
isotropic, so that the anisotropic part of P is small. Then, we define the isotropic 
part P^^*^ of P as the azimuthal average over P, and decompose P into an isotropic 
and an anisotropic part as 

P{%)^ j d2(p^((p)P^'°(^-9) , (4.60) 

which defines q uniquely. In general, ^(9) will be an almost singular function, 
but we shall show later that it has well-behaved moments. Both P^^° and q are 
normalised to unity and have vanishing first moments. With we define the 
brightness profiles 

/i^o(e) = j dV(9)^''°(e-9) 

/0(e) = y dV(9)^*'°(e-9) . (4.61) 

The first of these would be observed if the true image was smeared only with an 
isotropic PSF, and the second is the unlensed source smeared with Both of 
these brightness profiles are unobservable, but convenient for the following discus- 
sion. For each of them, we can define a quadrupole tensor as in (4.59). From each 
quadrupole tensor, we define the complex ellipticity X = Xi + iX2, in analogy to 
(4.4). 

If we define the centres of images including a spatial weight function, the property 
that the centre of the image is mapped onto the centre of the source through the 
lens equation is no longer strictly true. However, the deviations are expected to be 
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very small in general and will be neglected in the following. Hence, we choose 

— * 

coordinates such that = 0, and approximate the other centres to be at the origin 
as well. 

According to our fundamental assumption that the intrinsic ellipticities are ran- 
domly oriented, this property is shared by the ellipticities defined in terms of 
[see (4.61)], because it is unaffected by an isotropic PSF. Therefore, we can replace 
(4. 13) by E(%*^) = in the determination of g. The task is then to relate the observed 
image ellipticity x°^^ to We break it into several steps. 



From to % . We first look into the effect of an anisotropic PSF on the ob- 
served ellipticity. According to (4.60) and (4.61), 

/°bs(0) = j d2(p^(0-(p)/i^°(9) . (4.62) 
Let /(0) be an arbitrary function, and consider 



= J d2(p/-(9) /(cp) + ^qkij d2(p/-((p) + O (q^) . (4.63) 

We used the fact that q is normalised and has zero mean, and defined 



9ij = J ^^^<li^)Wj, qi = qn-q22, q2 = 2qi2. 



(4.64) 



The tensor qij is trace-less, qn — —q22, following from (4.60). We consider in the 
following only terms up to linear order in q. To that order, we can replace by 
jobs -j^ |.]^g gjj^j (4.63), since the difference would yield a term oc 0{q^). 

Hence, 

j d2(p/-(9)/(9) ^ j d20/(0)7°''^(0)-i^H j • (4.65) 

Setting Oiso = Oobs = o in the definition of the quadrupole tensors Q^^° and Q^^^, 
and choosing /(0) = 0,-0^VK(|0| V^^), yields 

m;' = QlY-\^ijkiqki. (4.66) 

where the Einstein summation convention was adopted, and where 

32 



Ajkl 



j d2(p/°''^((p) 



(4.67) 
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This then yields 



tr(en=tr(e°-J-Xa^a 

{&ri-&!2) = {Qn-Qt')-^iaqa 

)iso ^/-)Obs 

-12 — ^i^l2 



and 



where the sums run over a = 1 , 2.Q Up to linear order in (jd. 



obs 



with the definitions 



(4.68) 



(4.69) 



psm . 
'^ap 



-^^QobE (^ap-Xa'^P 



W + 2\(p\ 



w 



Xa= I d^cp/*'(9)na(9) 



2W' 



5ap+Tloc((p)rip(cp) 



w" 



and 



rii(0)=0f-02; ri2(0) =20102. 



(4.70) 



(4.71) 



was dubbed smear polarisability in Kaiser et al. (1995). It describes the (lin- 
ear) response of the ellipticity to a PSF anisotropy. Note that P^ depends on the 
observed brightness profile. In particular, its size decreases for larger images, as 
expected: The ellipticities of larger images are less affected by a PSF anisotropy 
than those of smaller images. 



The determination of Equation (4.69) provides a relation between the el- 
lipticities of an observed image and a hypothetical image smeared by an isotropic 
PSF. In order to apply this relation, the anisotropy term needs to be known. It 
can be determined from the shape of stellar images. 

Since stars are point-like and unaffected by lensing, their isotropically smeared 
images have zero ellipticity, y^*'^^° = 0. Hence, from (4.69), 

^a = (^*"")aBXr''- (4.72) 



^ We use Greek instead of Latin indices a, P = 1,2 to denote that they are not tensor 
indices. In particular, the components of % do not transform like a vector, but like the trace- 
less part of a symmetric tensor. 
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In general, the PSF varies with the position of an image. If this variation is suffi- 
ciently smooth, q can be measured for a set of stars, and approximated by a low- 
order polynomial across the data field. As pointed out by Hoekstra et al. (1998), the 
scale size o in the measurement of q is best chosen to be the same as that of the 
galaxy image under consideration. Hence, for each value of a, such a polynomial 
fit is constructed. This approach works well and provides an estimate of q at the 
position of all galaxies, which can then be used in the transformation (4.69). 



From %° to We now relate to the ellipticity %^ of a hypothetical image 

obtained from isotropic smearing of the source. To do so, we use (4.61) and (3.10) 
in the form /(0) = F{aQ), and consider 



/is«(0) = j d2(pr(j^9)pi^o(0-9) 

= ^/ d^rnQJ^'^'i^-^-'Q^Km ■ (4.73) 

The second step is merely a transformation of the integration variable, and in the 
final step we defined the brightness moment 

7(0)= /dV'(9)^(e-9) with P(Q) = -^P''°{R-% . (4.74) 

The function P is normalised and has zero mean. It can be interpreted as a PSF 
relating / to 7^ The presence of shear renders P anisotropic. 

We next seek to find a relation between the ellipticities of 7^^^^ and 7: 



2 



=daAAikAji j d?Qmi''\m ^ffz^^i^j . (4.75) 

The relation between the two filter scales is given by 6^ = (1 — k)^(1 + |gp)a^, and 
5 is the distortion (4.15). For small 5, we can employ a first-order Taylor expansion 
of the weight function W in the previous equation. This results in the following 
relation between % and 

Xa°-Xa = Qpgp, (4.76) 

where 
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2 2 

5ap = -/ d2e7-(0)r l^tf j ^iicx(e)Tip(e) 
La = -| d2e|e|27-(e)iy' J^Ti„(e) . (4.77) 

C is the polarisability of Kaiser et al. (1995). Whereas C is defined in terms 
of 7^^°, owing to the assumed smallness of q, the difference of C calculated with 7'^" 
and 7°"^^ would cause a second-order change in (4.76) and is neglected, so that we 
can calculate C directly from the observed brightness profile. 

In analogy to (4.60), we can decompose P into an isotropic and an anisotropic part, 
the latter one being small due to the assumed smallness of the shear, 

P(0) = j d2(ppi^°(9)^(e-9) . (4.78) 

Defining the brightness profile which would be obtained from smearing the source 
with the isotropic PSF 7^0(0) = / di\F{v?) P'^°{Q - 9), one finds 

7(0) = J dV''(9)^(e-9)- (4.79) 

Thus, the relation between 7 and 7^ is the same as that between 7^^^^ and 7'^", and we 
can write 

%l = Xa-PS^qp. (4.80) 

Note that should in principle be calculated by using 7 instead of 7°''^ in (4.69). 
However, due to the assumed smallness of g and q, the differences between 7°^^, 
7^^°, and 7 are small, namely of first order in g and q. Since q is of order g [as is 
obvious from its definition, and will be shown explicitly in (4.82)], this difference 
in the calculation of P^^ would be of second order in (4.80) and is neglected here. 

Eliminating % from (4.76) and (4.80), we obtain 

Xr = 5Ca + Qp^P + ^ap^p- (4-81) 

Now, for stellar objects, both and x'^° vanish, which implies a relation between 
q and g, 

^„=-(p-*)-piCj^^, (4.82) 

where the asterisk indicates that and C are to be calculated from stellar images. 
Whereas the result should in principle not depend on the choice of the scale length 
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in the weight function, it does so in practice. As argued in Hoekstra et al. (1998), 
one should use the same scale length in P™* and C* as for the galaxy object for 
which the ellipticities are measured. Defining now 

= - ^ay (^"*)75 Q*P ' (4-83) 
and combining (4.69) and (4.81), we finally obtain 

^O^^obs_psm^p_pg^^p (4.84) 

This equation relates the observed ellipticity to that of the source smeared by an 
isotropic PSF, using the PSF anisotropy and the reduced shear g. Since the ex- 
pectation value of is zero, (4.84) yields an estimate of g. The two tensors P^"^ 
and can be calculated from the brightness profile of the images. Whereas the 
treatment has been confined to first order in the PSF anisotropy and the shear, 
the simulations in Kaiser et al. (1995) and Hoekstra et al. (1998) show that the re- 
sulting equations can be applied even for moderately large shear. A numerical 
implementation of these relations, the imcat software, is provided by N. Kaiser 
(see http://www.ifa.hawaii.edu/~kaiser). We also note that modifications 
of this scheme were recently suggested (Rhodes et al. 1999, Kaiser 1999), as well 
as a completely different approach to shear measurements (Kuijken 1999). 
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5 Weak Lensing by Galaxy Clusters 

5.1 Introduction 



So far, weak gravitational lensing has chiefly been applied to determine the mass 
distribution of medium-redshift galaxy clusters. The main reason for this can be 
seen from eq. (4.55): Clusters are massive enough to be individually detected by 
weak lensing. More traditional methods to infer the matter distribution in clusters 
are (a) dynamical methods, in which the observed line-of-sight velocity distribu- 
tion of cluster galaxies is used in conjunction with the virial theorem, and (b) the 
investigation of the diffuse X-ray emission from the hot (~ 10^ K) intra-cluster gas 
residing in the cluster potential well (see, e.g., Sarazin 1986). 

Both of these methods are based on rather strong assumptions. For the dynamical 
method to be reliable, the cluster must be in or near virial equilibrium, which is not 
guaranteed because the typical dynamical time scale of a cluster is not much shorter 
than the Hubble time Hq^, and the substructure abundantly observed in clusters 
indicates that an appreciable fraction of them is still in the process of formation. 
Projection effects and the anisotropy of galaxy orbits in clusters further affect the 
mass determination by dynamical methods. On the other hand. X-ray analyses rely 
on the assumption that the intra-cluster gas is in hydrostatic equilibrium. Owing 
to the finite spatial and energy resolution of existing X-ray instruments, one often 
has to conjecture the temperature profile of the gas. Here, too, the influence of 
projection effects is difficult to assess. 

Whereas these traditional methods have provided invaluable information on the 
physics of galaxy clusters, and will continue to do so, gravitational lensing offers a 
welcome alternative approach, for it determines the projected mass distribution of 
a cluster independent of the physical state and nature of the matter. In particular, 
it can be used to calibrate the other two methods, especially for clusters showing 
evidence of recent merger events, for which the equilibrium assumptions are likely 
to fail. Finally, as we shall show below, the determination of cluster mass profiles 
by lensing is theoretically simple, and recent results show that the observational 
challenges can also be met with modem telescopes and instruments. 

Both shear and magnification effects have been observed in a number of galaxy 
clusters. In this chapter, we discuss the methods by which the projected mass dis- 
tribution in clusters can be determined from the observed lensing effects, and show 
some results of mass reconstructions, together with a brief discussion of their astro- 
physical relevance. Sect. 5.2 presents the principles of cluster mass reconstruction 
from estimates of the (reduced) shear obtained from image ellipticities. In contrast 
to the two-dimensional mass maps generated by these reconstructions, the aperture 
mass methods discussed in Sect. 5.3 determine a single number to characterise the 
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bulk properties of the cluster mass. Observational results are presented in Sect. 5.4. 
We outline further developments in the final section, including the combined anal- 
ysis of shear and magnification effects, maximum- likelihood methods for the mass 
reconstruction, and a method for measuring local lens parameters from the extra- 
galactic background noise. 



5. 2 Cluster Mass Reconstruction from Image Distortions 

We discussed in detail in Sect. 4 how the distortion of image shapes can be used 
to determine the local tidal gravitational field of a cluster. In this section, we de- 
scribe how this information can be used to construct two-dimensional mass maps 
of clusters. 

Shortly after the discovery of giant luminous arcs (Soucail et al. 1987a; 
Lynds & Petrosian 1989), Fort et al. (1988) detected a number of distorted galaxy 
images in the cluster A 370. They also interpreted these arclets as distorted back- 
ground galaxy images, but on a weaker level than the giant luminous arc in the 
same cluster. The redshift determination of one arclet by Mellier et al. (1991) pro- 
vided early support for this interpretation. Tyson et al. (1990) discovered a coher- 
ent distortion of faint galaxy images in the clusters A 1689 and CI 1409-1-52, and 
constrained their (dark) mass profiles from the observed 'shear'. Kochanek (1990) 
and Miralda-Escude (1991b) studied in detail how parameterised mass models for 
clusters can be constrained from such distortion measurements. 

The field began to flourish after Kaiser & Squires (1993) found that the distortions 
can be used for parameter-free reconstructions of cluster surface mass densities. 
Their method, and several variants of it, will be described in this section. It has 
so far been applied to about 15 clusters, and this number is currently limited by 
the number of available dark nights with good observing conditions at the large 
telescopes which are required for observations of weak lensing. 



5.2.1 Linear Inversion of Shear Maps 

Equation (3.15, page 50) shows that the shear y is a convolution of the surface mass 
density K with the kernel T). This relation is easily inverted in Fourier space to 
return the surface mass density in terms of a linear functional of the shear. Hence, 
if the shear can be observed from image distortions, the surface mass density can 
directly be obtained. Let the Fourier transform of k(0) be 




(5.1) 
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The Fourier transform of the complex kernel © defined in (3.15, page 50) is 

.(n=.ltt^. (5.2) 



©I 



Using the convolution theorem, eq. (3.15, page 50) can be written y{f) = 
71"^© (/) k(/) for / 0. Multiplying both sides of this equation with 2)* and us- 



ing (D OJ" =11^ gives 



ic{l) = %-^y(i)i)*{l) for 1^0, (5.3) 
and the convolution theorem leads to the final result 



K(0)-Ko = -/ d2e'©*(e-e')Y(e') 

(0-0') 7(6') 



- / d20' 



(5.4) 



(Kaiser & Squires 1993). The constant Kq in (5.4) appears because a constant sur- 
face mass density does not cause any shear and is thus unconstrained by Y- The two 
expressions in (5.4) are equivalent because 3(2>* y) = 0, as can be shown from the 
Fourier transforms of equations (3.12, page 49). In applications, the second form 
of (5.4) should be used to ensure that k is real. Relation (5.4) can either be applied 
to a case where all the sources are at the same redshift, in which case K and Y are 
defined as in eqs. (3.7) and (3.12), or where the sources are distributed in redshift, 
because K and Y are interpreted as convergence and shear for a hypothetical source 
at infinite redshift, as discussed in Sect. 4.3.2. 

In the case of a weak lens (k <^ 1, |y| <S 1), the shear map is directly obtained from 
observations, cf. (5.16). When inserted into (5.4), this map provides a parameter- 
free reconstruction of the surface mass density, apart from an overall additive con- 
stant. The importance of this result is obvious, as it provides us with a novel and 
simple method to infer the mass distribution in galaxy clusters. 

There are two basic ways to apply (5.4) to observational data. Either, one can derive 
a shear map from averaging over galaxy images by calculating the local shear on 
a grid in 0-space, as described in Sect. 4.3; or, one can replace the integral in (5.4) 
by a sum over galaxy images at positions 0,, 

K(e) = — £9t[2)*(0-0Oe/l • (5.5) 



Unfortunately, this estimate of K has infinite noise (Kaiser & Squires 1993) because 
of the noisy sampling of the shear at the discrete background galaxy positions. 
Smoothing is therefore necessary to obtain estimators of K with finite noise. The 
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form of eq. (5.5) is preserved by smoothing, but the kernel © is modified to another 
kernel 'D . In particular, Gaussian smoothing with smoothing length leads to 



©(6) = 



1 + 



02 



exp 



02 



©(0) 



(5.6) 



(Seitz & Schneider 1995a). The rms error of the resulting K map is of order 
OeA^^^/2^ where A'^ is the number of galaxy images per smoothing window, A'^ ~ 
07102. However, the errors will be strongly spatially correlated. Whereas the esti- 
mate (5.5) with replaced by (D uses the observational data more directly than by 
first constructing a smoothed shear map and applying (5.4) to it, it turns out that the 
latter method yields a mass map which is less noisy than the estimate obtained from 
(5.5), because (5.5) contains the 'shot noise' from the random angular position of 
the galaxy images (Seitz & Schneider 1995a). 



A lower bound to the smoothing length 0s follows from the spatial number density 
of background galaxies, i.e. their mean separation. More realistically, a smoothing 
window needs to encompass several galaxies. In regions of strong shear signals, 
A'^ ~ 10 may suffice, whereas mass maps in the outskirts of clusters where the shear 
is small may be dominated by noise unless A'^ ~ ICQ. These remarks illustrate that 
a single smoothing scale across a whole cluster may be a poor choice. We shall 
return to this issue in Sect. 5.5.1, where improvements will be discussed. 

Before applying the mass reconstruction formula (5.4) to real data, one should be 
aware of the following difficulties: 

(1) The integral in (5.4) extends over while real data fields are relatively small 
(most of the applications shown in Sect. 5.4 are based on CCDs with side 
lengths of about 7 arc min). Since there is no information on the shear outside 
the data field, the integration has to be restricted to the field, which is equiva- 
lent to setting Y= outside. This is done explicitly in (5.5). This cut-off in the 
integration leads to boundary artefacts in the mass reconstruction. Depending 
on the strength of the lens, its angular size relative to that of the data field, and 
its location within the data field, these boundary artefacts can be more or less 
severe. They are less important if the cluster is weak, small compared to the 
data field, and centred on it. 

(2) The shear is an approximate observable only in the limit of weak lensing. The 
surface mass density obtained by (5.4) is biased low in the central region of 
the cluster where the weak lensing assumption may not hold (and does not 
hold in those clusters which show giant arcs). Thus, if the inversion method is 
to be applied also to the iimer parts of a cluster, the relation between y and the 
observable 5 has to be taken into account. 

(3) The surface mass density is determined by (5.4) only up to an additive con- 
stant. We demonstrate in the next subsection that there exists a slightly differ- 
ent general invariance transformation which is present in all mass reconstruc- 
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tions based solely on image shapes. However, this invariance transformation 
can be broken by including the magnification effect. 

In the next three subsections, we shall consider points (1) and (2). In particular, we 
show that the first two problems can easily be cured. The magnification effects will 
be treated in Sect. 5.4. 



5.2.2 Non-Linear Generalisation of the Inversion, and an Invariance Transfor- 
mation 

In this section, we generalise the inversion equation (5.4) to also account for strong 
lensing, i.e. we shall drop the assumption K ^ 1 and |y| ^ 1. In this case, the shear 
y is no longer a direct observable, but at best the reduced shear g, or in general the 
distortion 5. In this case, the relation between K and the observable becomes non- 
linear. Furthermore, we shall assume here that all sources are at the same redshift, 
so that the reduced shear is well-defined. 

Consider first the case that the cluster is sub-critical everywhere, i.e. AttSl > for 
all 0, which implies |g(0)| < 1. Then, the mean image ellipticity £ is an unbiased 
estimate of the local reduced shear, so that 

Y(0)= [l-K(0)] (£)(0), (5.7) 

where the field (£) (0) is determined by the local averaging procedure described in 
Sect. 4.3.1. Inserting this into (5.4) leads to an integral equation for k(0). 



k(0) - Ko = - / <fB' \l - k(B')] 91 * (0 - 0') (£) (0' 

71 JR2 L J L 



(5.8) 



(Seitz & Schneider 1995a), which is readily solved by iteration. Starting from K = 
0, a first estimate of k(0) is obtained from (5.8), which after insertion into the right- 
hand side of (5.7) yields an update of y(0), etc. This iteration process converges 
quickly to the unique solution. 

The situation becomes only slightly more complicated if critical clusters are in- 
cluded. We only need to keep track of det;^ while iterating, because y must be 
derived from !/(£)* rather than from (£) where det;^ < 0. Hence, the local invari- 
ance between g and \/g* is broken due to non-local effects: A local jump from g 
to l/g* cannot be generated by any smooth surface mass density. 

After a minor modification^, this iteration process converges quickly. See 
Seitz & Schneider (1995a) for more details on this method and for numerical tests 



^ At points where K=l, \/g* = and E{£) = 0, while y remains finite. During the 
iteration, there will be points 6 where the field K is very close to unity, but where 
(e) is not necessarily small. This leads to large values of y, which render the itera- 
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done with a cluster mass distribution produced by a cosmological A^-body simula- 
tion. It should have become clear that the non-linear inversion process poses hardly 
any additional problem to the mass reconstruction compared to the linear inversion 
(5.4). 

This non-linear inversion still contains the constant Kq, and so the result will depend 
on this unconstrained constant. However, in contrast to the linear (weak lensing) 
case, this constant does not correspond to adding a sheet of constant surface mass 
density. In fact, as can be seen from (5.8), the transformation 



K(e) ^K'(0)=XK(e) + (i-X) or 
i-K'(e)l =x[i-K(e) 



(5.9) 



leads to another solution of the inverse problem for any value of A, 7^ 0. Another and 
more general way to see this is that the transformation K — > k/ changes yto y'{Q) = 
?iY(0), cf. (3.15, page 50). Hence, the reduced shear ^ = y(1 — k)~^ is invariant 
under the transformation (5.9), so that the relation between intrinsic and observed 
ellipticity is unchanged under the invariance transformation (5.9). This is the mass- 
sheet degeneracy pointed out by Falco et al. (1985) in a different context. We thus 
conclude that the degeneracy due to the invariance transformation (5.9) cannot be 
lifted if only image shapes are used. However, the magnification transforms like 

y(0)=X-2^(0), (5.10) 

so that the degeneracy can be lifted if magnification effects are taken into account 
(see Sect. 4.4). 

The invariance transformation leaves the critical curves of the lens mapping in- 
variant. Therefore, even the location of giant luminous arcs which roughly trace 
the critical curves does not determine the scaling constant X. In addition, the curve 
K = 1 is invariant under (5.9). However, there are at least two ways to constrain X. 
First, it is reasonable to expect that on the whole the surface mass density in clus- 
ters decreases with increasing separation from the cluster 'centre', so that A, > 0. 
Second, since the surface mass density K is non-negative, upper limits on X are 
obtained by enforcing this condition. 



5.2.3 Finite-Field Inversion Techniques 

We shall now turn to the problem that the inversion (5.4) in principle requires data 
on the whole sky, whereas the available data field is finite. A simple solution of this 

tion unstable. However, this instability can easily be removed if a damping factor like 
^1-|-|'^(6')|) exp ^-|Y^(6')|^ is included in (5.4). This modification leads to fast con- 
vergence and affects the result of the iteration only very slightly. 
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problem has been attempted by Seitz & Schneider (1995a). They extrapolated the 
measured shear field on the finite region U outside the data field, using a param- 
eterised form for the radial decrease of the shear. From a sample of numerically 
generated cluster mass profiles, Bartelmann (1995a) showed that this extrapolation 
yields fairly accurate mass distributions. However, in these studies the cluster was 
always assumed to be isolated and placed close to the centre of the data field. If 
these two conditions are not met, the extrapolation can produce results which are 
significantly off. In order to remove the boundary artefacts inherent in applying 
(5.4) to a finite field, one should therefore aim at constructing an unbiased finite- 
field inversion method. 



The basis of most finite-field inversions is a result first derived by Kaiser (1995). 
Equation (3.12, page 49) shows that shear and surface mass density are both given 
as second partial derivatives of the deflection potential \|/. After partially differenti- 
ating (3.12, page 49) and combining suitable terms we find 

VK=f^^'^+^^'^U«,(e). (5.11) 

\Y2,1-Yl,2/ 

The gradient of the surface mass density can thus be expressed by the first deriva- 
tives of the shear, hence k(9) can be determined, up to an additive constant, by in- 
tegrating (5.11) along appropriately selected curves. This can be done in the weak 
lensing case where the observed smoothed ellipticity field (e) (6) can be identified 
with y, and Uj{Q) can be constructed by finite differencing. If we insert Y= (1 — k) g 
into (5.11), we find after some manipulations 



WK{e) = - — ^ 

^-81-82 \ -g2 l+gl 



1-^1 -82 \ I 81,1+82,2 
82,1 -81,2 



= "g(e), (5.12) 

where 

i^(0) =ln[l-K(0)] . (5.13) 

Hence, using the smoothed ellipticity field (e) (0) as an unbiased estimator for g{Q), 
and assuming a sub-critical cluster, one can obtain the vector field Ug{Q) by finite 
differencing, and thus determine K{Q) up to an additive constant from line integra- 
tion, or, equivalently, 1 — k(0) up to an overall multiplicative constant. This is again 
the invariance transformation (5.9). 

In principle, it is now straightforward to obtain k(0) from the vector field My(0), or 
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^(6) from Ug{Q), simply by a line integration of the type 

k(0,0o) = k(0o)+ tdl-uy{f) , (5.14) 

— * — * — * 

where / is a smooth curve connecting 9 with Qq. If Uy is a gradient field, as it ideally 

— * 

is, the resulting surface mass density is independent of the choice of the curves /. 
However, since Uy is obtained from noisy data (at least the noise resulting from the 
intrinsic ellipticity distribution), it will in general not be a gradient field, so that 
(5.11) has no solution. Therefore, the various line integration schemes proposed 
(Schneider 1995, Kaiser et al. 1994, Bartelmann 1995a) yield different results. 

Realising that eq. (5.11) has no exact solution for an observed field Uy, we wish to 
find a mass distribution k(0) which satisfies (5.11) 'best' . In general, Uy can be split 
into a gradient field and a curl component, but this decomposition is not unique. 
However, as pointed out in Seitz & Schneider (1996), since the curl component 
is due to noise, its mean over the data field is expected to vanish. Imposing this 
condition, which determines the decomposition uniquely, they showed that 

K(e)-K= / d^e'H{e',e)-Uy{e') , (5.i5) 

where K is the average of k(0) over the data field U , and the kernel H is the gra- 
dient of a scalar function which is determined through a von Neumann bound- 
ary value problem, with singular source term. This problem can be solved ana- 
lytically for circular and rectangular data fields, as detailed in the Appendix of 
Seitz & Schneider (1996). If the data field has a more complicated geometry, an 
analytic solution is no longer possible, and the boundary value problem with a sin- 
gular source term cannot be solved numerically. 

An alternative method starts with taking the divergence of (5.11) and leads to the 
new boundary value problem, 

V^K=V-My with n-'Vx = n-Uy on du , (5.16) 

where n is the outward-directed normal on the boundary of ll. As shown in 
Seitz & Schneider (1998), eqs. (5.15) and (5.16) are equivalent. An alternative and 
very elegant way to derive (5.16) has been found by Lombardi & Bertin (1998b). 
They noticed that the 'best' approximation to a solution of (5.11) minimises the 
'action' 

/ d2e|VK(e)-My(e)p. (5.i7) 

Euler's equations of the variational principle immediately reproduce (5.16). This 
von Neumann boundary problem is readily solved numerically, using standard nu- 
merically techniques (see Sect. 19.5 of Press et al. 1986). 
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A comparison between these different finite-field inversion equations was per- 
formed in Seitz & Schneider (1996) and in Squires & Kaiser (1996) by numeri- 
cal simulations. Of all the inversions tested, the inversion (5.16) performs best 
on all scales (Seitz & Schneider 1996; Fig. 6 of Squires & Kaiser 1996). Indeed, 
Lombardi & Bertin (1998b) showed analytically that the solution of eq. (5.16) pro- 
vides the best unbiased estimate of the surface mass density. The relations (5.14) 
through (5.17) can be generalised to the non-weak case by replacing K with K and 
My with Ug. 

5.2.4 Accounting for a redshift distribution of the sources 

We now describe how the preceding mass reconstructions must be modified if the 
sources have a broad redshift distribution. In fact, only minor modifications are 
needed. The relation (e) = g for a single source redshift is replaced by eq. (4.28), 
which gives an estimate for the shear in terms of the mean image ellipticities and 
the surface mass density. This relation can be applied iteratively: 

Begin with K^^) = 0; then, eq. (4.28) yields a first guess for the shear y'^^^O) by 
setting Y= on the right-hand side. From (5.15), or equivalently by solving (5.16), 
the corresponding surface mass density k(^)(9) is obtained. Inserting K^^^ and y^^^ 
on the right-hand side of eq. (4.28), a new estimate y^^^ (6) for the shear is obtained, 
and so forth. 

This iteration process quickly converges. Indeed, the difficulty mentioned in foot- 
note 9 (page 90) no longer occurs since the critical curves and the curve(s) K = 1 are 
effectively smeared out by the redshift distribution, and so the iteration converges 
even faster than in the case of a single source redshift. 

Since K^") is determined only up to an additive constant for any y^"), the solution 
of the iteration depends on the choice of this constant. Hence, one can obtain a 
one-parameter family of mass reconstructions, like in (5.9). However, the resulting 
mass-sheet degeneracy can no longer be expressed analytically due to the complex 
dependence of (4.28) on k and y. In the case of weak lensing, it corresponds to 
adding a constant, as before. An approximate invariance transformation can also 
be obtained explicitly for mildly non-linear clusters with K < 0.7 and detJ? > 
everywhere. In that case, eq. (4.29) holds approximately, and can be used to show 
(Seitz & Schneider 1997) that the invariance transformation takes the form 

K(0) K'le) = Xk(0) + ii^^ . (5.18) 

In case of a single redshift Zs, such that w(zs) = {w), this transformation reduces to 
(5.9) for (w)k. 

We point out that the invariance transformation (5.18) in the case of a redshift dis- 
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tribution of sources is of different nature than that for a single source redshift. In 
the latter case, the reduced shear g{Q) is invariant under the transformation (5.9). 
Therefore, the probability distribution of the observed galaxy ellipticities is invari- 
ant, since it involves only the intrinsic ellipticity distribution and g. For a redshift 
distribution, the invariance transformation keeps the mean image ellipticities invari- 
ant, but the probability distributions are changed. Several strategies were explored 
in Seitz & Schneider (1997) to utilise this fact for breaking the invariance trans- 
formation. While possible in principle, the corresponding effect on the observed 
ellipticity distribution is too small for this approach to be feasible with existing 
data. 



5.2.5 Breaking the Mass-Sheet Degeneracy 

Equation (5.10) shows that the invariance transformation (5.9) affects the magni- 
fication. Hence, the degeneracy can be lifted with magnification information. As 
discussed in Sect. 4.4, two methods to obtain magnification information have been 
proposed. Detections of the number-density effect have so far been reported for two 
clusters (CI 0024+16, Fort et al. 1997; Abell 1689, Taylor et al. 1998). Whereas the 
information provided by the number density effect is less efficient than shear mea- 
surements (see Sect. 4.4.3), these two clusters appear to be massive enough to al- 
low a significant detection. In fact, Taylor et al. (1998) obtained a two-dimensional 
mass reconstruction of the cluster A 1689 from magnification data. 

In the case of weak lensing, and thus small magnifications, the magnification can 
locally be translated into a surface mass density - see (4.44). In general, the re- 
lation between /u and K is non-local, since // also depends on the shear. Various 
attempts to account for this non-locality have been published (van Kampen 1998, 
Dye & Taylor 1998). However, it must be noted that the surface mass density can- 
not be obtained from magnification alone since the magnification also depends on 
the shear caused by matter outside the data field. In practice, if the data field is 
sufficiently large and no mass concentration lies close to but outside the data field, 
the mass reconstruction obtained from magnification can be quite accurate. 

In order to break the mass-sheet degeneracy, it suffices in principle to measure one 
value of the magnification: Either the magnification at one location in the cluster, 
or the average magnification over a region. We shall see later in Sect. 5.5.1 how 
local magnification information can be combined with shear measurements. Doing 
it the naive way, expressing K in terms of /j and y, is a big waste of information: 
Since there is only one independent scalar field (namely the deflection potential V|/) 
describing the lens, one can make much better use of the measurements of y and 
/Li than just combining them locally; the relation between them should be used to 
reduce the error on K. 
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5.3 Aperture Mass and Multipole Measures 



Having reconstructed the mass distribution, we can estimate the local dispersion of 
K (e.g., Lombardi & Bertin 1998b). However, the errors at different points will be 
strongly correlated, and so it makes little sense to attach an error bar to each point of 
the mass map. Although mass maps contain valuable information, it is sometimes 
preferable to reduce them to a small set of numbers such as the mass-to-light ratio, 
or the correlation coefficient between the mass map and the light distribution. One 
of the quantities of interest is the total mass inside a given region. As became clear 
in the last section, this quantity by itself cannot be determined from observed image 
ellipticities due to the invariance transformation. But a quantity related to it, 

C(e;i^i,i^2) = K(e;i^i) - K(e;i^i,i^2) , (5.19) 
the difference between the mean surface mass densities in a circle of radius i^i 

— * 

around and in an annulus of inner and outer radii i^i and 13^2, respectively, can 
be determined in the weak-lensing case, since then the invariance transformation 
corresponds to an additive constant in K which drops out of (5.19). We show in this 
section that quantities like (5.19) can directly be obtained from the image elliptic- 
ities without the need for a two-dimensional mass map. In Sect. 5.3.1, we derive a 
generalised version of (5.19), whereas we consider the determination of mass mul- 
tipoles in Sect. 5.3.2. The prime advantage of all these aperture measures is that the 
error analysis is relatively straightforward. 



5. 3. 1 Aperture Mass Measures 

Generally, aperture mass measures are weighted integrals of the local surface mass 
density, 

Map(eo) = j d2eK(e)c/(e-eo) , (5.20) 

with weight function U(Q). Assume now that the weight function is constant on 
self-similar concentric curves. For example, the ^-statistics (5.19), introduced by 
Kaiser (1995), is of the form (5.20), with a weight function that is constant on 
circles, f/(i3) = (JiiJ?)"^ for < < i^i, U{-&) = for i^i < < '&2, 

and zero otherwise. 

Let the shape of the aperture be described by a closed curve c{X), X E I, where 
/ is a finite interval, such that c x c = ciC2 — C2C\ > for all X E I. We can then 
uniquely define a new coordinate system (Z?, X) by choosing a centre Go and defining 
= 00 -I- bc{X). The weight function should be constant on the curves c{X) so that 



96 



it depends only on b. In the new coordinate system, (5.20) reads 

M^p{%) = 1^ dbbU{b) die xdK[Bo + bcil)], (5.21) 

where the factor be xc is the Jacobian determinant of the coordinate transforma- 
tion. Equation (5.21) can now be transformed in three steps; first, by a partial inte- 
gration with respect to b; second, by replacing partial derivatives of K with partial 
derivatives of y using eq. (5.11); and third by removing partial derivatives of y in 
another partial integration. In carrying out these steps, we assume that the weight 
function is compensated, 

jdbbU{b)^0. (5.22) 

Introducing 

Q{b) = ^ 1^ db'b'U{b')-U{b) (5.23) 
Jo 

and writing the curve c in complex notation, C{X) = ci(k) +ic2{'k), leads to the 
final result (Schneider & Bartelmann 1997) 

Map(0o) = / d^BQm] , (5.24) 

where the argument ?i of C is to be evaluated at position = Go + bc{X) .[^ The nu- 
merator in the final term of (5.24) projects out a particular component of the shear, 
whereas the denominator is part of the Jacobian of the coordinate transformation. 
The constraint (5.22) assures that an additive constant in K does not affect Map. The 
expression (5.24) has several nice properties which render it useful: 

(1) If the function U (b) is chosen such that it vanishes for b > bi, then from (5.22) 
and (5.23), Q{b) = Ofoxb> b2- Thus, the aperture mass can be derived from 
the shear in a finite region. 

(2) If U (b) = const for < b < bi, then Q{b) = in that interval. This means 
that the aperture mass can be determined solely from the shear in an annulus 
bi < b < b2- This has two advantages which are relevant in practice. First, 
if the aperture is centred on a cluster, the bright central cluster galaxies may 
prevent the detection of a large number of faint background galaxies there, 
so that the shear in the central part of the cluster may be difficult to measure. 
In that case it is still possible to determine the total mass inside the cluster 
core using (5.24) with an appropriately chosen weight function U . Second, 
although in general the shear cannot be determined directly from the image 

There are of course other ways to derive (5.24), e.g. by inserting (5.4) into (5.20). See 
Squires & Kaiser (1996) for a different approach using Gauss's law. 
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ellipticities [but only the reduced shear y(1 — k)~^], we can choose the size 
bi of the inner boundary of the annulus sufficiently large that K ^ 1 in the 
annulus, and then y r:^ g is an accurate approximation. Hence, in that case 
the mean image ellipticity directly yields an estimate of the shear. Then, the 
integral (5.24) can be transformed into a sum over galaxy images lying in the 
annulus, yielding Map directly in terms of the observables. This in turn has the 
great advantage that an error analysis of Map is fairly simple. 

We consider circular apertures as an example, for which (Z?, X) = (0, cp) and C((p) = 
exp(i(p). Then, 3 (C*C) = 1, and 

3(7C*C*) =Yt(e;eo) := -[Yicos(2(p)+Y2sin(2(p)] = -%[y{Q + Qq)c-^'^] , 

(5.25) 

where we have defined the tangential component Yt of the shear relative to the point 
6o. Hence, for circular apertures (5.24) becomes 

Map(eo) = I d2ee(|0|)Yt(e;eo) (5.26) 

(Kaiser etal. 1994; Schneider 1996b). The ^-statistics (5.19) is obtained from 
(5.26) by setting 2(6) = ^Q'^ [Ti{^-^j)]~^ for i^i < 6 < 1^2 and 2(6) = 
otherwise, so that 

C(eo;di,d2) = TvS^ [ d^e^^ , (5.27) 



where the integral is taken over the annulus i^i <Q <'&2- 

For practical purposes, the integral in (5.26) is transformed into a sum over galaxy 
images. Recalling that e is an estimator for Y in the weak-lensing case, and that the 
weight function can be chosen to avoid the strong-lensing regime, we can write 

Map(eo) = - £G(|e;-eo|)et,-(eo) , (5.28) 

" i 

where we have defined, in analogy to Yt, the tangential component et; of the ellip- 
ticity of an image at 6( relative to the point 0o by 

et, = -9t(8e-2i^), (5.29) 

cp is the polar angle of 9 — Bq, and n is the number density of galaxy images. The 
rms dispersion o(Map) of Map in the case of no lensing is found from the (two- 
dimensional) dispersion Og of the intrinsic ellipticity of galaxies. 



Ee'de-eol) 



1/2 



(5.30) 
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The rms dispersion in the presence of lensing will deviate only weakly from o(Map) 
as long as the assumption of weak lensing in the annulus is satisfied. Hence, a(Map) 
can be used as an error estimate for the aperture mass and as an estimate for the 
signal-to-noise ratio of a mass measurement. 

This opens the interesting possibility to search for (dark) mass concentrations using 
the aperture mass (Schneider 1996b). Consider a weight function U with the shape 
of a Mexican hat, and a data field U on which apertures of angular size can be 
placed. For each aperture position, one can calculate Map and the dispersion. The 
dispersion can be obtained either from the analytical formula (5.30), or it can be 
obtained directly from the data, by randomising the position angles of all galaxy 
images within the aperture. The dispersion can be obtained from many realisations 
of this randomisation process. Large values of Map will be obtained for mass con- 
centrations whose characteristic size and shape is close to that of the chosen filter 
function U. Thus, by varying the size of the filter, different mass concentrations 
will preferentially be selected. The aperture mass is insensitive to mass concentra- 
tions of much smaller and much larger angular scales than the filter size. 

We have considered in Sect. 4.5 the signal-to-noise ratio for the detection of a sin- 
gular isothermal sphere from its weak lensing effect. The estimate (4.54) was ob- 
tained by an optimal weighting scheme for this particular mass distribution. Since 
real mass concentrations will deviate from this profile, and also from the assumed 
symmetry, the filter function U should have a more generic shape. In that case, the 
S/N will have the same functional behaviour as in (4.54), but the prefactor depends 
on the exact shape of U. For the filter function used in Schneider (1996b), S/N is 
about 25% smaller than in (4.54). Nevertheless, one expects that the aperture-mass 
method will be sensitive to search for intermediate-redshift haloes with character- 
istic velocity dispersions above ~ 600kms~^ 

This expectation has been verified by numerical simulations, which also contained 

larger and smaller scale mass perturbations. In addition, a detailed strong-lensing 
investigation of the cluster MS 1512+62 has shown that its velocity dispersion is 
very close to ~ 600km s ^ and it can be seen from the weak- lensing image dis- 
tortion alone with very high significance (Seitz et al. 1998b), supporting the fore- 
going quantitative prediction. Thus, this method appears to be a very promising 
way to obtain a mass- selected sample of haloes which would be of great cosmo- 
logical interest (cf. Reblinsky & Bartelmann 1999). We shall return to this issue in 
Sect. 6.7.2. 



5.3.2 Aperture Multipole Moments 

Since it is possible to express the weighted mass within an aperture as an in- 
tegral over the shear, with the advantage that in the weak lensing regime this 
integral can be replaced by a sum over galaxy ellipticities, it is natural to ask 
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whether a similar result holds for multipole moments of the mass. As shown in 
Schneider & Bartelmann (1997), this is indeed possible, and we shall briefly out- 
line the method and the result. 

Consider a circular aperturef^ centred on a point Bq. Let t/(|0|) be a radial weight 
function, and define the n-th multipole moment by 

Q^"^= d00"+it/(0) / d(pe"''PK(0o + 0) . (5.31) 
JO Jo 

This can be replaced by an integral over y in two ways: (5.31) can be integrated by 
parts with respect to cp (for n 7^ 0), or with respect to 0, again utilising (5.11). The 
resulting expressions are assumed to contain no boundary terms, which restricts 
the choice for the weight function U (0) . The remaining integrals then contain par- 
tial derivatives of K with respect to cp and 0, respectively. Writing (5.11) in polar 
coordinates, these partial derivatives can be replaced by partial derivatives of the 
shear components with respect to cp and 0. Integrating those by parts with respect 
to the appropriate coordinate, and enforcing vanishing boundary terms, we find two 
different expressions for the Q^"^ : 

2^;^ = I d20^W(0)y(0o + 0). (5.32) 

The two expressions for q^"^ are formally very different, although it can be shown 
that the resulting two expressions for Q^"^ are equivalent. The two very different 
equations for the same result are due to the fact that the two components of the shear 
y are not mutually independent, which was not used in the derivation of (5.32). 

We now have substantial freedom to choose the weight function and to select one 
of the two expressions for or even to take a linear combination of them. We 
note the following interesting examples: 

(1) The weight function U (0) can be chosen to vanish outside an annulus, to be 
piece-wise differentiable, and to be zero on the inner and outer boundary of 
the annulus. The Q^"^ for n 7^ can then be expressed as integrals of the shear 
over the annulus, with no further restrictions on U. In particular, U (0) does 
not need to be a compensated weight function. 

(2) U (0) can be a piece-wise differentiable weight function which is constant for 
< 01, and decreases smoothly to zero at = 02 > 0i . Again, 2^"^ for n 7^ 
can be expressed as an integral of the shear in the annulus 0i < < 02- Hence, 
as for the aperture mass, multipole moments in the inner circle can be probed 
with the shear in the surrounding annulus. 

(3) One can choose, for n > 2, a piece-wise differentiable weight function U (0) 
which behaves like 0^" for > 02 and decreases to zero at = 0i < 02. 

' ^ The method is not restricted to circular apertures, but this case will be most relevant for 
measuring multipole moments. 
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In that case, the muhipole moments of the matter outside an annulus can be 
probed with data inside the annulus. 

For practical applications, the integral in (5.32) is replaced by a sum over galaxy 
ellipticities. The dispersion of this sum is easily obtained in the absence of lensing, 
with an expression analogous to (5.30). Therefore, the signal-to-noise ratio for the 
multipole moments is easily defined, and thus also the significance of a multipole- 
moment detection. 



5.4 Application to Observed Clusters 

Soon after the parameter-free two-dimensional mass reconstruction was suggested 
by Kaiser & Squires (1993), their method was applied to the cluster MS 1224 
(Fahlman et al. 1994). Since then, several groups have used it to infer the mass pro- 
files of clusters. In parallel to this, several methods have been developed to measure 
the shear from CCD data, accounting for PSF smearing and anisotropy, image dis- 
tortion by the telescope, noise, blending etc. - see the discussion in Sect. 4.6. We 
will now summarise and discuss several of these observational results. 
Tyson etal. (1990) 

made the first attempt to constrain the mass distribution of a cluster from a weak- 
lensing analysis. They discovered a statistically significant tangential alignment of 
faint galaxy images relative to the centre of the clusters A 1689 and CI 1409-1-52. 
Their "lens distortion map" obtained from the image ellipticities yields an estimate 
of the mass distribution in these clusters. A detailed analysis of their method is 
given in Kaiser & Squires (1993). From a comparison with numerical simulations, 
Tyson et al. showed that the best isothermal sphere model for the clusters has a typ- 
ical velocity dispersion of Op ~ 1300±200kms~^ for both clusters. In particular, 
their analysis showed that diffuse dark matter in the cluster centres is needed to 
account for the observed image distortions. 

The inversion method developed by Kaiser & Squires (1993) provided a systematic 
approach to reconstruct the mass distribution in clusters. It was first applied to 
the cluster MS 1224+20 (Fahlman et al. 1994) at redshift Zd = 0.33, which had 
been selected for its high X-ray luminosity. Their square data field with side-length 
~ 14' was composed of several exposures, most of them with excellent seeing. 
They estimated the shear from image ellipticities, corrected for the PSF anisotropy, 
and applied a correction factor / as defined in Sect. 4.6.1. They found / ~ 1.5 in 
simulations, in very good agreement with Wilson et al. (1996). The resulting shear 
pattern, obtained from 2147 galaxy images, clearly shows a circular pattern around 
the cluster centre as defined by the centroid of the optical and X-ray light. Using 
the Kaiser & Squires reconstruction method (5.7), Fahlman et al. produced maps 
of the dimension-less surface mass density k(0), both by taking all galaxy images 
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into account, and after splitting the galaxy sample into a 'brighter' and 'fainter' 
sample of roughly equal size. Although differing in detail, the resulting mass show 
an overall similarity. In particular, the position of the mass centre is very similar in 
all maps. 
Fahlman et al. 

applied the aperture mass method to determine the cluster mass - see (5.20) and 
(5.28) - in an annulus centred on the cluster centre with inner radius 1^1 = 2'76 and 
an outer radius such that the annulus nearly fits into their data field. The lower limit 
to the mean surface mass density in the annulus is K(2f76) > ^ = 0.06 ± 0.013. To 
convert this into an estimate of the physical surface mass density and the total mass 
inside the aperture, the mean distance ratio D^s/^^s for the galaxy population has to 
be estimated, or equivalently the mean value of w as defined in (5.34). 

While the redshift distribution is known statistically for the brighter sub-sample 
from redshift surveys, the use of the fainter galaxies requires an extrapolation of the 
galaxy redshifts. From that, Fahlman et al. estimated the mass within a cylinder of 
radius 1^1 = 2.'76, corresponding to 0.4Sh~^ Mpc for an Einstein-de Sitter cosmol- 
ogy, to be ~ 3.5 X 10^^ h^^ Mq. This corresponds to a mass-to-light ratio (in solar 
units) of M/L ~ 800 /z. Carlberg et al. (1994) obtained 75 redshifts of galaxies in 
the cluster field, of which 30 are cluster members. From their line-of-sight veloc- 
ity dispersion, the cluster mass can be estimated by a virial analysis. The resulting 
mass is lower by a factor ~ 3 than the weak-lensing estimate. The mass-to-light 
ratio from the virial analysis is much closer to typical values in lower-redshift clus- 
ters like Coma, which has M/L ^ 270/?^^ The high mass estimate of this cluster 
was recently confirmed in a completely independent study by Fischer (1999). 

The origin of this large apparent discrepancy is not well understood yet, and sev- 
eral possibilities are discussed in Kaiser et al. (1994). It should be pointed out that 
lensing measures the total mass inside a cone, weighted by the redshift-dependent 
factor Dd^ds/^s, and hence the lensing mass estimate possibly includes substan- 
tial foreground and background material. While this may cause an overestimate 
of the mass, it is quite unlikely to cause an overestimate of the mass-to-light ra- 
tio of the total material inside the cone. Foreground material will contribute much 
more strongly to the light than to the measured mass, and additional matter be- 
hind the cluster will not be very efficient as a lens. The uncertainty in the redshift 
distribution of the faint galaxies translates into an uncertainty in the mass. How- 
ever, all background galaxies would have to be put at a redshift ~ 4 to explain 
the mass discrepancy, while redshift surveys show that the brighter sub-sample of 
Fahlman et al. has a mean redshift below unity. The mass estimate is only weakly 
dependent on the assumed cosmological model. On the other hand, the light dis- 
tribution of the cluster MS 1224 is not circular, and it cannot be excluded that this 
cluster is not in virial equilibrium. 
Squires et al. (1996a) 
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compared the mass profiles derived from weak lensing data and the X-ray emis- 
sion of the cluster A 2218. Under the assumption that the hot X-ray-emitting intra- 
cluster gas is in hydrostatic equilibrium between gravity and thermal pressure sup- 
port, the mass profile of the cluster can be constrained. The reconstructed mass 
map qualitatively agrees with the optical and X-ray light distributions. Using the 
aperture mass estimate, a mass-to-light ratio of M/L— (440 ± 80) h in solar units 
is found. The radial mass profile appears to be flatter than isothermal. Within the 
error bars, it agrees with the mass profile obtained from the X-ray analysis, with a 
slight indication that at large radii the lensing mass is larger than the mass inferred 
from X-rays. 

Abell 2218 also contains a large number of arcs and multiply-imaged galaxies 
which have been used by Kneib et al. (1996) to construct a detailed mass model 
of the cluster's central region. In addition to the main mass concentration, there is 
a secondary clump of cluster galaxies whose effects on the arcs is clearly visible. 
The separation of these two mass centres is 61" . Whereas the resolution of the weak 
lensing mass map as obtained by Squires et al. is not sufficient to reveal a distinct 
secondary peak, the elongation of the central density contours extend towards the 
secondary galaxy clump. 

General agreement between the reconstructed mass map and the distribution of 
cluster galaxies and X-ray emission has also been found for the two clusters 
CI 1455+22 (z = 0.26) and CI 0016+16 (z = 0.55) by Small et al. (1995a). Both 
are highly X-ray luminous clusters in the Einstein Extended Medium Sensitivity 
Survey (EMSS; Stocke et al. 1991). The orientation and ellipticity of the central 
mass peak is in striking agreement with those of the galaxy distribution and the X- 
ray map. However, the authors find some indication that the mass is more centrally 
condensed than the other two distributions. In addition, given the finite angular res- 
olution of the mass map, the core size derived from weak lensing is most likely 
only an upper bound to the true value, and in both clusters the derived core size 
is significantly larger than found in clusters with giant luminous arcs (see, e.g.. 
Fort & MeUier 1994). 

The mass-to-light ratios for the two clusters are ~ lOOO/i and ~ 740 /i, respectively. 
However, at least for CI 0016+16, the mass scale is fairly uncertain, owing to the 
high cluster redshift and the unknown redshift distribution of the faint galaxies. The 
mean value of D^^/D^ must be estimated from an assumed distribution p{z). 

The unprecedented imaging quality of the refurbished Hubble Space Telescope 
(HST) can be used profitably for weak lensing analyses. Images taken with the 
Wide Field Planetary Camera 2 (WFPC2) have an angular resolution of order 0"1, 
limited by the pixel size. Because of this superb resolution and the lower sky back- 
ground, the number density of galaxy images for which a shape can reliably be mea- 
sured is considerably larger than from the ground, so that higher-resolution mass 
maps can be determined. The drawback is the small field covered by the WFPC2, 
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which consists of 3 CCD chips with 80" side-length each. Using the first publicly 
available deep image of a cluster obtained with the WFPC2, Seitz et al. (1996) 
have constructed a mass map of the cluster CI 0939+47 (z — 0.41). Figure 14 
clearly shows a mass peak near the left boundary of the frame shown. This max- 
imum coincides with the cluster centre as determined from the cluster galaxies 
(Dressier & Gunn 1992). Furthermore, a secondary maximum is clearly visible in 
the mass map, as well as a pronounced minimum. When compared to the optical 
image, a clear correlation with the bright (cluster) galaxies is obvious. In particular, 
the secondary maximum and the minimum correspond to the same features in the 
bright galaxy distribution. A formal correlation test confirms this similarity. Apply- 
ing the maximum-likelihood mass reconstruction technique (Seitz et al. 1998c; see 
Sect. 5.4) to the same HST image, Geiger & Schneider (1999) constructed a higher- 
resolution map of this cluster. The angular resolution achieved is much higher in 
the cluster centre, predicting a region in which strong lensing effects may occur. 
Indeed, Trager et al. (1997) reported on a highly elongated arc and a triple image, 
with both source galaxies having a redshift z 3.97. 
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Fig. 14. Left panel: WFPC2 image of the cluster C10939-F4713 (A 851); North is at the bot- 
tom. East to the right. The cluster centre is located at about the upper left comer of the left 
CCD, a secondary maximum of the bright (cluster) galaxies is seen close to the interface of 
the two lower CCDs, and a minimum in the cluster light is at the interface between the two 
right CCDs. In the lensing analysis, the data from the small CCD (the Planetary Camera) 
were not used. Right panel: The reconstructed mass distribution of A 851, assuming a mean 
redshift of the = 295 galaxies with 24 < /? < 25.5 of (z) = 1 . 

The X-ray map of this cluster (Schindler & Wambsganss 1997) shows that the two 
mass peaks are also close to two X-ray components. The determination of the total 
mass inside the WFPC2 frame is difficult, for two reasons: First, the high redshift 
of the cluster implies that the mean value of D^^/D^ depends quite sensitively on 
the assumed redshift distribution of the background galaxies. Second, the small 
field of the WFPC2 precludes the measurement of the surface mass density at large 
distance where K tends to zero, and thus the mass-sheet degeneracy implies a con- 
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siderable uncertainty in the mass scale. Attempting to lift the mass sheet degeneracy 
with the number-density effect - see 4.4.1 -, a mass-to-light ratio of ~ 250h was 
derived within the WFPC2 aperture. This value is also affected by the unknown 
fraction of cluster members in the catalog of faint galaxies. Seitz et al. (1996) as- 
sumed that the spatial distribution of faint cluster galaxies follows that of brighter 
cluster galaxies. The striking difference between the M/L ratios for this and the 
other clusters described above may be related to the fact that CI 0939-1-47 is the 
highest-redshift cluster in the Abell catalog (A 851). Hence, it was selected by its 
high optical luminosity, whereas the previously mentioned clusters are all X-ray 
selected. The X-ray luminosity of CI 0939-1-47 is fairly small for such a rich clus- 
ter (Schindler & Wambsganss 1996). Since X-ray luminosity and cluster mass are 
generally well correlated, the small M/L-ratio found from the weak lensing analy- 
sis is in agreement with the expectations based on the high optical flux and the low 
X-ray flux. Note that the large spread of mass-to-light ratios as found by the exist- 
ing cluster mass reconstructions is unexpected in the frame of hierarchical models 
of structure formation and thus poses an interesting astrophysical problem. 
Hoekstra et al. (1998) 

reconstructed the mass distribution in the cluster MS 1358-1-62 from a mosaic 
of HST images, so that their data field in substantially larger than for a single 
HST pointing (about 8' x 8'). This work uses the correction method presented in 
Sect. 4.6.2, thus accounting for the relatively strong PSF anisotropy at the edges of 
each WFPC2 chip. A weak- lensing signal out to 1.5Mpc is found. The X-ray mass 
is found to be slightly lower than the dynamical mass estimate, but seems to agree 
well with the lensing mass determination. 
Luppino & Kaiser (1997) 

found a surprisingly strong weak-lensing signal in the field of the high-redshift 
cluster MS 1054—03 (z — 0.83). This implies that the sheared galaxies must have 
an appreciably higher redshift than the cluster, thus strongly constraining their red- 
shift distribution. In fact, unless the characteristic redshift of these faint background 
galaxies is > 1.5, this cluster would have an unrealistically large mass. It was 
also found that the lensing signal from the bluer galaxies is stronger than from 
the redder ones, indicating that the characteristic redshift of the bluer sample is 
higher. In fact, the mass estimated assuming (zs) = 1-5 agrees well with results 
from analyses of the X-ray emission (Donahue et al. 1998) and galaxy kinemat- 
ics (Tran et al. 1999). Clowe et al. (1998) derived weak lensing maps for two addi- 
tional clusters at z - 0.8, namely MS 1 137+66 atz = 0.783 and RXJ 1716+67 at 
z = 0.813. 

The mass distribution in the supercluster MS 0302+17 at z = 0.42 was recon- 
structed by Kaiser et al. (1998) in a wide-field image of size ~ 30'. The supercluster 
consists of three clusters which are very close together on the sky and in redshift. 
The image contains about 30,000 galaxies from which a shear can be measured. 
This shear was found to correlate strongly with the distribution of the early-type 
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(foreground) galaxies in the field, provided that the overall mass-to-light ratio is 
about 250h. Each of the three clusters, which are also seen in X-rays, is recov- 
ered in the mass map. The ratios between mass and light or X-ray emission differ 
slightly across the three clusters, but the differences are not highly significant. 

A magnification effect was detected from the depletion of the number counts (see 
Sect. 4.4.1) in two clusters. Fort et al. (1997) discovered that the number density 
of very faint galaxies drops dramatically near the critical curve of the cluster 
CI 0024-1-16, and remains considerably lower than the mean number density out 
to about twice the Einstein radius. This is seen in photometric data with two filters. 
Fort et al. (1997) interpret this broad depletion curve in terms of a broad redshift 
distribution of the background galaxies, so that the location of the critical curve of 
the cluster varies over a large angular scale. A spatially-dependent number deple- 
tion was detected in the cluster A 1689 by Taylor et al. (1998). 

These examples should suffice to illustrate the current status of weak lensing 
cluster mass reconstructions. For additional results, see Squires et al. (1996b), 
Squires et al. (1997), Fischer et al. (1997), Fischer & Tyson (1997). Many of the 
difficulties have been overcome; e.g., the method presented in Sect. 4.6.2 appears 
to provide an accurate correction method for PSF effects. The quantitative results, 
for example for the M/L-ratios, are somewhat uncertain due to the lack of suffi- 
cient knowledge on the source redshift distribution, which applies in particular to 
the high-redshift clusters. 

Further large-format HST mosaic images either are already or will soon become 
available, e.g. for the clusters A 2218, A 1689, and MS 1054-03. Their analysis 
will substantially increase the accuracy of cluster mass determinations from weak 
lensing compared to ground-based imaging. 

5.5 Outlook 

We have seen in the preceding subsection that first results on the mass distribution 
in clusters were derived with the methods described earlier. Because weak lensing 
is now widely regarded as the most reliable method to determine the mass distri- 
bution of clusters, since it does not rely on assumptions on the physical state and 
symmetries of the matter distribution, further attempts at improving the method are 
in progress, and some of them will briefly be outlined below. 

In particular, we describe a method which simultaneously accounts for shear and 
magnification information, and which can incorporate constraints from strong- 

lensing features (such as arcs and multiple images of background sources). A 
method for the determination of the local shear is described next which does not 
rely on the detection and the quadrupole measurement of individual galaxies, and 
instead makes use of the light from very faint galaxies which need not be individu- 
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ally detected. We will finally consider the potential of weak lensing for determining 
the redshift distribution of galaxies which are too faint to be investigated spectro- 
scopically, and report on first results. 

5. 5. 1 Maximum-Likelihood Cluster Reconstructions 

The mass reconstruction method described above is a direct method: The locally 
averaged observed image ellipticities (e) are inserted into an inversion equation 
such as (5.9) to find the mass map k(0). The beauty of this method is its simplicity 
and computational speed. Mass reconstructions from the observed image elliptici- 
ties are performed in a few CPU seconds. 

The drawback of this method is its lack of flexibility. No additional information can 
be incorporated into the inversion process. For example, if strong-lensing features 

like giant arcs or multiple galaxy images are observed, they should be included in 
the mass reconstruction. Since such strong-lensing features typically occur in the 
innermost parts of the clusters (at < 30" from cluster centres), they strongly con- 
strain the mass distribution in cluster cores which can hardly be probed by weak 
lensing alone due to its finite angular resolution. A further example is the incorpora- 
tion of magnification information, as described in Sect. 4.4, which can in principle 
not only be used to lift the mass-sheet degeneracy, but also provides local informa- 
tion on the shape of the mass distribution. 

An additional problem of direct inversion techniques is the choice of the smoothing 
scale which enters the weight factors m, in (5.15). We have not given a guideline on 
how this scale should be chosen. Ideally, it should be adapted to the data. In regions 
of strong shear, the signal-to-noise ratio of a shear measurement for a fixed number 
of galaxy images is larger than in regions of weak shear, and so the smoothing scale 
can be smaller there. 

Recently, these problems have been attacked with inverse methods. Suppose the 
mass distribution of a cluster is parameterised by a set of model parameters pk- 
These model parameters could then be varied until the best-fitting model for the 
observables is found. Considering for example the observed image ellipticities 8/ 
and assuming a non-critical cluster, the expectation value of is the reduced shear 
g at the image position, and the dispersion is determined (mainly) by the intrinsic 
dispersion of galaxy ellipticities Og. Hence, one can define a x^-function 



and minimise it with respect to the p]^. A satisfactory model is obtained if is of 
order A^g at its minimum, as long as the number of parameters is much smaller than 
A^g. If the chosen parameterisation does not achieve this minimum value, another 




(5.33) 
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one must be tried. However, the resulting mass model will depend on the parameter- 
isation which is a serious drawback relative to the parameter-free inversion methods 
discussed before. 

This problem can be avoided with 'generic' mass models. For instance, the 
deflection potential \|/(9) can be composed of a finite sum of Fourier modes 
(Squires & Kaiser 1996), whose amplitudes are the parameters Pfc.[^The number 
of Fourier modes can be chosen such that the resulting per degree of freedom is 
approximately unity. Additional modes would then start to fit the noise in the data. 

Alternatively, the values of the deflection potential \|/ on a (regular) grid can be 
used as the p\^. Bartelmann et al. (1996) employed the locally averaged image el- 
lipticities and the size ratios ((o) / ((o)o - see (4.47) - on a grid. The correspond- 
ing expectation values of these quantities, the reduced shear g and the magnifica- 
tion were calculated by finite differencing of the discretised deflection poten- 
tial \|/. Since both y and k, and thus /j, are unchanged under the transformation 
\|/(0) -f>|/o + a • 9, the deflection potential has to be kept fixed at three grid 

points. If no magnification information is used, the mass-sheet degeneracy allows a 
further transformation of \|/ which leaves the expected image ellipticities invariant, 
and the potential has to be kept fixed at four grid points. 

A %^-function was defined using the local dispersion of the image ellipticities and 
image sizes relative to unlensed sizes of galaxies with the same surface brightness, 
and it was minimised with respect to the values of \|/ on the grid points. The grid 
spacing was chosen such that the resulting minimum %^ has approximately the cor- 
rect value. Tests with synthetic data sets, using a numerically generated cluster mass 
distribution, showed that this method reconstructs very satisfactory mass maps, and 
the total mass of the cluster was accurately reproduced. 

If a finer grid is used, the model for the deflection potential will reproduce noise 
features in the data. On the other hand, the choice of a relatively coarse grid which 
yields a satisfactory j} implies that the resolution of the mass map is constant 
over the data field. Given that the signal increases towards the centre of the clus- 
ter, one would like to use a finer grid there. To avoid over-fitting of noise, the 
maximum-likelihood method can be complemented by a regularisation term (see 
Press et al. 1986, Chap. 18). As shown by Seitz et al. (1998c), a maximum-entropy 
regularisation (Narayan & Nityananda 1986) is well suited for the problem at hand. 
As in maximum-entropy image restoration (e.g., Lucy 1994), a prior is used in the 
entropy term which is a smoothed version of the current density field, and thus is 

It is important to note that the deflection potential \|/ rather than the surface mass density 
K (as in Squires & Kaiser (1996)) should be parameterised, because shear and surface mass 
density depend on the local behaviour of \J/, while the shear cannot be obtained from the 
local K, and not even from K on a finite field. In addition, the local dependence of K and y 
on \|/ is computationally much more efficient than calculating y by integrating over K as in 
Bridle et al. (1998). 
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being adapted during the minimisation. The relative weight of the entropy term is 
adjusted such that the resulting minimum is of order unity per degree of freedom. 

In this scheme, the expectation values and dispersions of the individual image 
ellipticities and sizes are found by bi-linear interpolation of K and y on the grid 
which themselves are obtained by finite differencing of the potential. When tested 
on synthetic data sets, this refined maximum-likelihood method produces mass 
maps with considerably higher resolution near the cluster centre without over- 
fitting the noise at larger cluster-centric distances. The practical implementation 
of this method is somewhat complicated. In particular, if critical clusters are stud- 
ied, some modifications have to be included to allow the minimisation algorithm to 
move critical curves across galaxy images in the lens plane. However, the quality 
of the reconstruction justifies the additional effort, especially if high-quality data 
from HST images are available. A first application of this method is presented by 
Geiger & Schneider (1999). 

Inverse methods such as the ones described here are likely to become the stan- 
dard tool for cluster mass profile reconstruction, owing to their flexibility. As men- 
tioned before, additional constraints from strong lensing signatures such as arcs and 
multiply-imaged sources, can straightforwardly be incorporated into these meth- 
ods. The additional numerical effort is negligible compared to the efforts needed to 
gain the observational data. Direct inversion methods will certainly retain an impor- 
tant role in this field, to obtain quick mass maps during the galaxy image- selection 
process (e.g., cuts in colour and brightness can be applied). Also, a mass map ob- 
tained by a direct method as a starting model in the inverse methods reduces the 
computational effort. 



5.5.2 The Auto-Correlation Function of the Extragalactic Background Light 

So far, we described how shear can be determined from ellipticities of individual 

galaxy images on a CCD. In that context, a galaxy image is a statistically significant 
flux enhancement on the CCD covering several contiguous pixels and being more 
extended than the PSF as determined from stars. Reducing the threshold for the 
signal-to-noise per object, the number density of detected galaxies increases, but 
so does the fraction of misidentifications. Furthermore, the measured elUpticity of 
faint galaxies has larger errors than that of brighter and larger images. The detection 
threshold therefore is a compromise between high number density of images and 
significance per individual object. 

Even the faintest galaxy images whose ellipticity cannot be measured reliably still 

contain information on the lens distortion. It is therefore plausible to use this in- 
formation, by 'adding up' the faintest galaxies statistically. For instance, one could 
co-add their brightness profiles and measure the shear of the combined profiled. 
This procedure, however, is affected by the uncertainties in defining the centres of 
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the faint galaxies. Any error in the position of the centre, as defined in (5.1), will 
affect the resulting ellipticity. 



To avoid this difficulty, and also the problem of faint object definition at all. 
Van Waerbeke et al. (1997) have suggested considering the auto-correlation func- 
tion (ACF) of the 'background' light. Most of the sky brightness is due to atmo- 
spheric scattering, but this contribution is uniform. Fluctuations of the brightness on 
the scale of arc seconds is supposedly mainly due to very faint galaxies. Therefore, 
these fluctuations should intrinsically be isotropic. If the light from the faint galax- 
ies propagates through a tidal gravitational field, the isotropy will be perturbed, and 
this provides a possibility to measure this tidal field. 

Specifically, if 7(0) denotes the brightness distribution as measured on a CCD, and 
I is the brightness averaged over the CCD (or a part of it, see below), the auto- 
correlation function ^(0) of the brightness is defined as 

^(0) = ((/(^)-7) (7(^ + 0)-/)). , (5.34) 

where the average is performed over all pairs of pixels with separation 0. From 
the invariance of surface brightness (3.10, page 49) and the locally linearised lens 
mapping, 7(0)-7(^)(.!?0), one finds that the observed ACF is related to the intrinsic 
ACF defined in complete analogy to (5.34), by 

(5.35) 

Thus the transformation from intrinsic to observed ACF has the same functional 
form as the transformation of surface brightness. In analogy to the definition of the 
quadrupole tensor Q for galaxy images - see (5.2) - the tensor of second moments 
of the ACF is defined as 

Mii = 'J ^ . (5.36) 

' /d20^(0) 

The transformation between the observed quadrupole tensor M and the intrinsic 
one, M is the same as for the moment tensor of image ellipticities, (5.5), dvL = 
J19^ Jl.As shown by Van Waerbeke et al. (1997), the tensor Oi{ directly determines 
the distortion 5, 

Hence, 5 is related to M in the same way as the complex ellipticity % is related to 
Q. In some sense, the ACF plays the role of a single 'equivalent' image from which 
the distortion can be determined, instead of an ensemble average over individual 
galaxy ellipticities. 
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Working with the ACF has several advantages. First, centres of galaxy images do 
not need to be determined, which avoids a potential source of error. Second, the 
ACF can be used with substantial flexibility. For instance, one can use all galaxy 
images which are detected with high significance, determine their ellipticity, and 
obtain an estimate of 5 from them. Sufficiently large circles containing these galax- 
ies can be cut out of the data frame, so that the remaining frame is reminiscent of 
a Swiss cheese. The ACF on this frame provides another estimate of 5, which is 
independent information and can statistically be combined with the estimate from 
galaxy ellipticities. Or one can use the ACF only on galaxy images detected within 
a certain magnitude range, still avoiding the need to determine centres. 

Third, on sufficiently deep images with the brighter objects cut out as just described, 
one might assume that the intrinsic ACF is due to a very large number of faint 
galaxies, so that the intrinsic ACF becomes a universal function. This function can 
in principle be determined from deep HST images. In that case, one also knows the 
width of the intrinsic ACF, as measured by the trace or determinant of M , and can 
determine the magnification from the width of the observed ACF, very similar to 
the method discussed in Sect. 4.4.2, but with the advantage of dealing with a single 
'universal source'. 

If this universal intrinsic ACF does exist, corrections of the measured for a 
PSF considerably simplify compared to the case of individual image ellipticities, 
as shown by Van Waerbeke et al. (1997). They performed several tests on synthetic 
data to demonstrate the potential of the ACF method for the recovery of the shear 
applied to the simulated images. Van Waerbeke et al. determined shear fields of two 
clusters, with several magnitude thresholds for the images which were punched out. 
A comparison of these shear fields with those obtained from the standard method 
using galaxy ellipticities clearly shows that the ACF method is at least competitive, 
but since it provides additional information from those parts of the CCD which 
are unused by the standard method, it should in be employed any case. The optimal 
combination of standard method and ACF still needs to be investigated, but detailed 
numerical experiments indicate that the ACF may be the best method for measuring 
very weak shear amplitudes (L. van Waerbeke & Y. Mellier, private communica- 
tion). 



5. 5. 3 The Redshift Distribution of Very Faint Galaxies 

Galaxy redshifts are usually determined spectroscopically. A successful redshift 
measurement depends on the magnitude of the galaxy, the exposure time, and the 
spectral type of the galaxy. If it shows strong emission or absorption lines, as star- 
forming galaxies do, a redshift can much easier be determined than in absence of 
strong spectral features. The recently completed Canadian-French Redshift Sur- 
vey (CFRS) selected 730 galaxies in the magnitude interval 17.5 < / < 22.5 (see 
Lilly etal. 1995 and references therein). For 591 of them (81%), redshifts were 
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secured with multi-slit spectroscopy on a 3.6m telescope (CFHT) with a typical ex- 
posure time of ~ 8 hours. Whereas the upcoming lOm-class telescopes will be able 
to perform redshift surveys to somewhat fainter magnitude limits, it will be diffi- 
cult to secure fairly complete redshift information of a flux-limited galaxy sample 
fainter than / ~ 24. In addition, it can be expected that many galaxies in a flux- 
limited sample with fainter threshold will have redshifts between ~ 1.2 and ~ 2.2, 
where the cleanest spectral features, the Oil emission line atX = 312 J nm and the 
X — 400 nm break are shifted beyond the region where spectroscopy can easily be 
done from the ground. 

As we have seen, the calibration of cluster mass distributions depends on the as- 
sumed redshift distribution of the background galaxies. Most of the galaxies used 
for the reconstruction are considerably fainter than those magnitude limits for 
which complete redshift samples are available, so that this mass calibration re- 
quires an extrapolation of the redshift distribution from brighter galaxy samples. 
The fact that lensing is sensitive to the redshift distribution is not only a source of 
uncertainty, but also offers the opportunity to investigate the redshift distribution of 
galaxies too faint to be investigated spectroscopically. Several approaches towards 
a redshift estimate of faint galaxies by lensing have been suggested, and some of 
them have already shown spectacular success, as will be discussed next. 

First of all, a strongly lensed galaxy (e.g. a giant luminous arc) is highly magnified, 
and so the gravitational lens effect allows to obtain spectra of objects which would 
be too faint for a spectroscopic investigation without lensing. It was possible in this 
way to measure the redshifts of several arcs, e.g., the giant arc in A 370 at z = 0.724 
(Soucail et al. 1988), the arclet A 5 in A 370 at z = 1.305 (Mellier et al. 1991), the 
giant arc in CI 2244-02 at z = 2.237 (Mellier et al. 1991), and the 'straight arc' in 
A 2390 at z = 0.913. In the latter case, even the rotation curve of the source galaxy 
was determined (Pello et al. 1991). For a more complete list of arc redshifts, see 
Fort & Mellier (1994). If the cluster contains several strong-lensing features, the 
mass model can be sufficiently well constrained to determine the arc magnifications 
(if they are resolved in width, which has become possible only from imaging with 
the refurbished HST), and thus to determine the unlensed magnitude of the source 
galaxies, some of which are fainter that B ^25. 

Some clusters, such as A 370 and A 2218, were observed in great detail both from 
the ground and with HST, and show a large number of strongly lensed images. 
They can be used to construct very detailed mass models of the cluster centre (e.g., 
Kneib et al. 1993; Kneib et al. 1996). An example is A 2218, in which at least five 
multiply imaged systems were detected (Kneib et al. 1996), and several giant arcs 
were clearly seen. Refining the mass model for A 2218 constructed from ground- 
based data (Kneib et al. 1995) with the newly discovered or confirmed strong lens- 
ing features on the WFPC2 image, a strongly constrained mass model for the clus- 
ter can be computed and calibrated by two arc redshifts (a five-image system at 
z = 0.702, and at z= 1.034). 
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Visual inspection of the WFPC2 image immediately shows a large number of ar- 
clets in A 2218, which surround the cluster centre in a nearly perfect circular pat- 
tern. These arclets have very small axis ratios, and most of them are therefore highly 
distorted. The strength of the distortion depends on the redshift of the correspond- 
ing galaxy. Assuming that the sources have a considerably smaller ellipticity than 
the observed images, one can then estimate a redshift range of the galaxy. 

To be more specific, let p^^\e^^^) be the probability density of the intrinsic source 
ellipticity, assumed for simplicity to be independent of redshift. The corresponding 
probability distribution for the image ellipticity is then 

p(£)=p(^)(£(^)(8))det(^^j , (5.38) 

where the transformation £(^)(£) is given by eq. (4.12, page 61), and the final term 
is the Jacobian of this transformation. For each arclet near the cluster centre where 
the mass profile is well constrained, the value of the reduced shear g is determined 
up to the unknown redshift of the source - see eq. (5.35). 

One can now try to maximise p{e) with respect to the source redshift, and in that 
way find the most likely redshift for the arc .[^Depending on the ellipticity of the 
arclet and the local values of shear and surface mass density, three cases have to be 
distinguished: (1) the arclet has the 'wrong' orientation relative to the local shear, 
i.e., if the source lies behind the cluster, it must be even more elliptical than the 
observed arclet. For the arclets in A 2218, this case is very rare. (2) The most prob- 
able redshift is 'at infinity', i.e., even if the source is placed at very high redshift, 
the maximum of p{e) is not reached. (3) p{e) attains a maximum at a finite redshift. 
This is by far the most common case in A 2218. 

This method, first applied to A 370 (Kneib et al. 1994), was used to estimate the 
redshifts of ~ 80 arclets in A 2218 brighter than i? ~ 25. Their typical redshifts 
are estimated to be of order unity, with the fainter sub- sample 24 < i? < 25 ex- 
tending to somewhat higher redshifts. For one of them, a redshift range 2.6 < 
z < 3.3 was estimated, and a spectroscopic redshift of z = 2.515 was later mea- 
sured (Ebbels et al. 1996), providing spectacular support for this method. Addi- 
tional spectroscopic observations of arclets in A 2218 were conducted and further 
confirmed the reliability of the method for the redshift estimates of individual ar- 
clets (Ebbels etal. 1998). 

Another success of this arclet redshift estimate was recently achieved in the cluster 
A 2390, which can also be modelled in great detail from HST data. There, two 
arclets with very strong elongation did not fit into the cluster mass model unless 



^ This simplified treatment neglects the magnification bias, i.e. the fact that at locations of 
high magnification the redshift probability distribution is changed - see Sect. 4.3.2. 
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they are at very high redshift. Spectroscopic redshifts of z ~ 4.05 were recently 
measured for these two arclets (Frye & Broadhurst 1998, Pello et al. 1999). 



However, several issues should be kept in mind. First, the arclets for which 
a reliable estimate of the redshift can be obtained are clearly magnified, and 
thus the sample is magnification biased. Since it is well known that the galaxy 
number counts are considerable steeper in the blue than in the red (see, e.g., 
Small et al. 1995a), blue galaxies are preferentially selected as arclets - see 
eq. (4.42). This might also provide the explanation why most of the giant arcs are 
blue (Broadhurst 1995). Therefore, the arclets represent probably a biased sample 
of faint galaxies. Second, the redshift dependence of p(e) enters through the ratio 
Dds/Ds- For a cluster at relatively low redshift, such as A 2218 (zd = 0.175), this 
ratio does not vary strongly with redshift once the source redshift is larger than 
~ 1. Hence, to gain more accurate redshift estimates for high-redshift galaxies, a 
moderately-high redshift cluster should be used. 

The method just described is not a real 'weak lensing' application, but lies on the 
borderline between strong and weak lensing. With weak lensing, the redshifts of 
individual galaxy images cannot be determined, but some statistical redshift esti- 
mates can be obtained. Suppose the mass profile of a cluster has been reconstructed 
using the methods described in Sect. 5.2 or Sect. 5.5.1, for which galaxy images 
in a certain magnitude range were used. If the cluster contains strong-lensing fea- 
tures with spectroscopic information (such as a giant luminous arc with measured 
redshift), then the overall mass calibration can be determined, i.e., the factor (w) - 
see Sect. 4.3.2 - can be estimated, which provides a first integral constraint on the 
redshift distribution. 

Repeating this analysis with several such clusters at different redshifts, further esti- 
mates of (w) with different are obtained, and thus additional constraints on the 
redshift distribution. In addition, one can group the faint galaxy images into sub- 
samples, e.g., according to their apparent magnitude. Ignoring for simplicity the 
magnification bias (which can safely be done in the outer parts of clusters), one can 
determine {w) for each magnitude bin. Restricting our treatment to the regions of 
weak lensing only, such that |y| <S 1, K <S 1, the expectation value of the ellipticity 
8; of a galaxy at position 0, is (w)y(0(), and so an estimate of (w) for the galaxy 
sub-sample under consideration is 



In complete analogy, Bartelmann & Narayan (1995) suggested the 'lens parallax 

method', an algorithm for determining mean redshifts for galaxy sub-samples at 
fixed surface brightness, using the magnification effect as described in Sect. 4.4.2. 
Since the surface brightness / is most likely much more strongly correlated with 
galaxy redshift than the apparent magnitude (due to the (1 -|-z)~^ decrease of bolo- 




(5.39) 
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metric surface brightness with redshift), a narrow bin in / will probably correspond 
to a fairly narrow distribution in redshift, allowing to relate {w) of a surface bright- 
ness bin fairly directly to a mean redshift in that bin, while (w) in magnitude bins 
can only be translated into redshift information with a parameterised model of the 
redshift distribution. On the other hand, apparent magnitudes are easier to measure 
than surface brightness and are much less affected by seeing. 

Even if a cluster without strong lensing features is considered, the two methods just 
described can be applied. The mass reconstruction then gives the mass distribution 
up to an overall multiplicative constant. We assume here that the mass-sheet degen- 
eracy can be lifted, either using the magnification effect as described in Sect. 5.4, 
or by extending the observations so sufficiently large distances so that K ~ near 
the boundary of the data field. The mass scale can then be fixed by considering the 
brightest sub-sample of galaxy images for which a shear signal is detected if they 
are sufficiently bright for their redshift probability distribution to be known from 
spectroscopic redshift surveys (Bartelmann & Narayan 1995). 

Whereas these methods have not yet rigourously been applied, there is one obser- 
vational result which indicates that the faint galaxy population has a relatively high 
median redshift. In a sequence of clusters with increasing redshift, more and more 
of the faint galaxies will lie in the foreground or very close behind the cluster and 
therefore be unlensed. The dependence of the observed lensing strength of clusters 
on their redshift can thus be used as a rough indication of the median redshift of the 
faint galaxies. This idea was put forward by Small et al. (1994), who observed three 
clusters with redshifts z = 0.26, z = 0.55 and z = 0.89. In the two lower-redshift 
clusters, a significant weak lensing signal was detected, but no significant signal in 
the high-redshift cluster. From the detection, models for the redshift distribution of 
faint / < 25 can be ruled out which predict a large fraction to be dwarf galaxies at 
low redshift. The non-detection in the high-redshift cluster cannot easily be inter- 
preted since little information (e.g., from X-ray maps) is available for this cluster, 
and thus the absence of a lensing signal may be due to the cluster being not massive 
enough. 

However, the detection of a strong shear signal in the cluster MS 1054—03 at 
z = 0.83 (Luppino & Kaiser 1997) implies that a large fraction of galaxies with 
/ < 25.5 must lie at redshifts larger than z ~ 1.5. They split their galaxy sample 
into red and blue sub-samples, as well as into brighter and fainter sub-samples, and 
found that the shear signal is mainly due to the fainter and the blue galaxies. If all 
the faint blue galaxies have a redshift Zs — 1-5, the mass-to-light ratio of this cluster 
is estimated to be M/L ~ 580 /i, and if they all lie at redshift Zs = '^,M/L exceeds 
~ 1000^. This observational result, which is complemented by several additional 
shear detections in high-redshift clusters, one of them at z = 0.82 (G. Luppino, 
private communication), provides the strongest evidence for the high-redshift pop- 
ulation of faint galaxies. In addition, it strongly constrains cosmological models; an 
^0 = 1 cosmological model predicts the formation of massive clusters only at rel- 
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atively low redshifts (e.g., Richstone et al. 1992; Bartelmann et al. 1993) and has 
difficulties to explain the presence of strong lensing clusters at redshift z ~ 0.8. 

Recently, Lombardi & Bertin (1999) and Gautret et al. (1998) suggested that weak 

lensing by galaxy clusters can be used to constrain the cosmological parameters ^2o 
and Q.A- Both of these two different methods assume that the redshift of background 
galaxies can be estimated, e.g. with sufficiently precise photometric-redshift tech- 
niques. Owing to the dependence of the lensing strength on the angular-diameter 
distance ratio D^g/Ds, sufficiently detailed knowledge of the mass distribution in 
the lens and of the source redshifts can be employed to constrain these cosmologi- 
cal parameters. Such a determination through purely geometrical methods would be 
very valuable, although the observational requirements for applying these methods 
appear fairly demanding at present. 
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6 Weak Cosmological Lensing 



In this section, we review how weak density perturbations in otherwise homoge- 
neous and isotropic Friedmann-Lemaitre model universes affect the propagation 
of light. We first describe how light propagates in the homogeneous and isotropic 
background models, and then discuss how local density inhomogeneities can be 
taken into account. The result is a propagation equation for the transverse separa- 
tion between the light rays of a thin light bundle. 

The solution of this equation leads to the deflection angle a of weakly deflected 
light rays. In close analogy to the thin-lens situation, half the divergence of the 
deflection angle can be identified with an effective surface-mass density Keff. The 
power spectrum of Kgff is closely related to the power spectrum of the matter fluctu- 
ations, and it forms the central physical object of the further discussion. Any two- 
point statistics of cosmic magnification and cosmic shear can then be expressed in 
a fairly simple manner in terms of the effective-convergence power spectrum. 

We discuss several applications, among which are the uncertainty in brightness de- 
terminations of cosmologically distant objects due to cosmic magnification, and 
several measures for cosmic shear, one of which is particularly suited for deter- 
mining the effective-convergence power spectrum. At the end of this chapter, we 
turn to higher-order statistical measures of cosmic lensing effects, which reflect the 
non-Gaussian nature of the non-linearly evolved density perturbations. 

When we give numerical examples, we generally employ four different model uni- 
verses. All have the CDM power spectrum for density fluctuations, but different 
values for the cosmological parameters. They are summarised in Tab. 1. We choose 
two Einstein-de Sitter models, SCDM and oCDM, normalised either to the local 
abundance of rich clusters or to Og = 1, respectively, and two low-density models, 
OCDM and ACDM, which are cluster normalised and either open or spatially flat, 
respectively. 

Table 1 

Cosmological models and their parameters used for numerical examples 



Model 


^0 


Ha 


h 


Normalisation 


<78 


SCDM 


1.0 


0.0 


0.5 


cluster 


0.5 


aCDM 


1.0 


0.0 


0.5 


ag 


1.0 


OCDM 


0.3 


0.0 


0.7 


cluster 


0.85 


ACDM 


0.3 


0.7 


0.7 


cluster 


0.9 



Light propagation in inhomogeneous model universes has been the sub- 
ject of numerous studies. Among them are Zel'dovich & Ya.B. (1964), 
Dashevskii et al. ( 1 965), Kristian & Sachs ( 1 966), Gunn ( 1 967), 
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Jaroszynski et al. (1990), Babul & Lee (1991), Bartelmann & Schneider (1991), 
Blandfordetal. (1991), Miralda-Escude (1991a), and Kaiser (1992). Non-linear 
effects were included analytically by Jain & Seljak (1997), who also considered 
statistical effects of higher than second order, as did Bemardeau et al. (1997). 
A particularly suitable measure for cosmic shear was introduced by 
Schneider et al. (1998a). 



6.1 Light Propagation; Choice of Coordinates 



As outlined in Sect. 3.2.1 (page 52), the governing equation for the propagation of 
thin light bundles through arbitrary space times is the equation of geodesic devia- 
tion (e.g. Misner et al. 1973, § 1 1; Schneider et al. 1992, § 3.5), or Jacobi equation 
(3.23, page 53). This equation implies that the transverse physical separation ^ be- 
tween neighbouring rays in a thin light bundle is described by the second-order 
differential equation 

d^H 

^=n. (6.1) 

where T is the optical tidal matrix (3.25, page 53) which describes the influence 
of space-time curvature on the propagation of light. The affine parameter X has to 
be chosen such that it locally reproduces the proper distance and increases with 
decreasing time, hence dX= —cadt. The elements of the matrix T then have the 
dimension [length] . 

We already discussed in Sect. 3.2.1 that the optical tidal matrix is proportional to 
the unit matrix in a Friedmann-Lemaitre universe, 

T = 3?, / , (6.2) 

where the factor ^ is determined by the Ricci tensor as in eq. (3.26, page 53). For 
a model universe filled with a perfect pressure-less fluid, !^ can be written in the 
form (3.28, page 54). 

It will prove convenient for the following discussion to replace the affine parameter 
X in eq. (6.1) by the comoving distance w, which was defined in eq. (2.3, page 13) 
before. This can be achieved using eqs. (3.31) and (3.32) together with the defini- 
tion of Bubble's parameter, H{a) = da^^. Additionally, we introduce the comoving 
separation vector f = 4 These substitutions leave the propagation equation (6.1) 
in the exceptionally simple form 

d^jc 

^Kx = Q, (6.3) 



dw^ 
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where K is the spatial curvature given in eq. (2.30, page 19). Equation (6.3) has 
the form of an oscillator equation, hence its solutions are trigonometric or hyper- 
bolic functions, depending on whether K is positive or negative. In the special case 
of spatial flatness, ^ = 0, the comoving separation between light rays is a linear 
function of distance. 



6.2 Light Deflection 



We now proceed by introducing density perturbations into the propagation equa- 
tion (6.3). We assume throughout that the Newtonian potential ^> of these inhomo- 
geneities is small, |4>| -C c^, that they move with velocities much smaller than the 
speed of light, and that they are localised, i.e. that the typical scales over which 4> 
changes appreciably are much smaller than the curvature scale of the background 
Friedmann-Lemaitre model. Then, there exists a local neighbourhood around each 
density perturbation which is large enough to contain the perturbation completely 
and still small enough to be considered flat. Under these circumstances, the metric 
is well approximated by the first post-Newtonian order of the Minkowski metric 
(3.36, page 56). It then follows from eq. (3.36) that the effective local index of 
refraction in the neighbourhood of the perturbation is 

= „ = 1 - (6.4) 

at 

Fermat's principle (e.g. Blandford & Narayan 1986; Schneider 1985) demands that 
the light travel time along actual light paths is stationary, hence the variation of 
Jnd/ must vanish. This condition implies that light rays are deflected locally ac- 
cording to 

^V^^). (6.5) 



dw^ 



c2 



In weakly perturbed Minkowski space, this equation describes how an actual light 
ray is curved away from a straight line in unperturbed Minkowski space. It is there- 
fore appropriate for describing light propagation through e.g. the Solar system and 
other well-localised mass inhomogeneities. 

This interpretation needs to be generalised for large-scale mass inhomogeneities 
embedded in an expanding cosmological background, since the meaning of a 
"straight" fiducial ray is then no longer obvious. In general, any physical fiducial 
ray will also be deflected by potential gradients along its way. We can, however, 
interpret x as the comoving separation vector between an arbitrarily chosen fidu- 
cial light ray and a closely neighbouring light ray. The right-hand side of eq. (6.5) 
must then contain the difference A(Vj^4>) of the perpendicular potential gradients 
between the two rays to account for the relative deflection of the two rays. 
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Let us therefore imagine a fiducial ray starting at the observer (w = 0) into direction 
= 0, and a neighbouring ray starting at the same point but into direction 7^ 0. 
Let further x(0, w) describe the comoving separation between these two light rays at 
comoving distance w. Combining the cosmological contribution given in eq. (6.3) 
with the modified local contribution (6.5) leads to the propagation equation 



dw^ 



+ Kx=-^A^^V^^[x{e,w),w]^ . (6.6) 



The notation on the right-hand side indicates that the difference of the perpendic- 
ular potential gradients has to be evaluated between the two light rays which have 
comoving separation x(0, w) at comoving distance w from the observer. 

Linearising the right-hand side of eq. (6.6) in x immediately returns the geodesic 
deviation equation (6.1) with the full optical tidal matrix, which combines the ho- 
mogeneous cosmological contribution (3.28, page 54) with the contributions of 
local perturbations (3.37, page 56). 

Strictly speaking, the comoving distance w, or the afflne parameter X, are changed 
in the presence of density perturbations. Here, we assume that the global properties 
of the weakly perturbed Friedmann-Lemaitre models remain the same as in the 
homogeneous and isotropic case, and under this assumption the comoving distance 
w remains the same as in the unperturbed model. 

To solve eq. (6.6), we first construct a Green's function G{w,w'), which has to 
be a suitable linear combination of either trigonometric or hyperbolic functions 
since the homogeneous equation (6.6) is an oscillator equation. We further have to 
specify two boundary conditions. According to the situation we have in mind, these 
boundary conditions read 

x = 0, -^ = (6.7) 

aw 

at w = 0. The first condition states that the two light rays start from the same point, 
so that their initial separation is zero, and the second condition indicates that they 

— * 

set out into directions which differ by 0. 

The Green's function is then uniquely determined by 

I fK(w — w') for w > w' 
G(w,w') = <; ^ , (6.8) 

otherwise 

with fK{w) given in eq. (2.4, page 14). As a function of distance w, the comoving 
separation between the two light rays is thus 

x(0,w)=/j,(w)0-^ dw'fK{w-w')A\y^^[x{Qy)y]j . (6.9) 
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The perpendicular gradients of the Newtonian potential are to be evaluated along 
the true paths of the two light rays. In its exact form, eq. (6.9) is therefore quite 
involved. 



Assuming that the change of the comoving separation vector x between the two 
actual rays due to light deflection is small compared to the comoving separation of 
unperturbed rays, 

|.-(e.w-)-/.(w-)e|^^ 
IAK)e| 

we can replace x(Q,w') by /a:(w')0 in the integrand to arrive at a much simpler 
expression which corresponds to the Bom approximation of small-angle scattering. 
The Born approximation allows us to replace the difference of the perpendicular 
potential gradients with the perpendicular gradient of the potential difference. Tak- 
ing the potential difference then amounts to adding a term to the potential which 
depends on the comoving distance w' from the observer only. For notational sim- 
plicity, we can therefore rename the potential difference A4> between the two rays 
to4>. 



It is an important consequence of the Bom approximation that the Jacobian matrix 
of the lens mapping (3.11, page 49; 6.28 below) remains symmetric even in the 
case of cosmological weak lensing. In a general multiple lens-plane situation, this 
is not the case (Schneider et al. 1992, chapter 9). 

If the two light rays propagated through unperturbed space-time, their comoving 
separation at distance w would simply be x' (0, w) = fx {w)Q, which is the first term 
on the right-hand side of eq. (6.9). The net deflection angle at distance w between 
the two rays is the difference between / and x, divided by the angular diameter 
distance to w, hence 

- = /.We--Ce.w) 2 ^^^^^ 

fxiw) Jo fK{w) 

Again, this is the deflection angle of a light ray that starts out at the observer 
into direction relative to a nearby fiducial ray. Absolute deflection angles can- 
not be measured. All measurable effects of light deflection therefore only depend 
on derivatives of the deflection angle (6. 1 1), so that the choice of the fiducial ray is 
irrelevant for practical purposes. For simplicity, we call a(0, w) the deflection angle 

— * 

at distance w of a light ray starting into direction on the observer's sky, bearing 
in mind that it is the deflection angle relative to an arbitrarily chosen fiducial ray, 
so that a(0, w) is far from unique. 

In an Einstein-de Sitter universe, fxiw) = w. Defining}; = w'/w, eq. (6.11) simpli- 
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fies to 

O /" 1 

a(e,w) = -^ / dy(l-y)'^_L^(wye,wy) . (6.12) 
Jo 

Clearly, the deflection angle a depends on the direction on the sky into which the 
light rays start to propagate, and on the comoving distance w to the sources. 

Recall the various approximations adopted in the derivation of eq. (6.11): (i) The 

density perturbations are well localised in an otherwise homogeneous and isotropic 
background, i.e. each perturbation can be surrounded by a spatially flat neighbour- 
hood which can be chosen small compared to the curvature radius of the back- 
ground model, and yet large enough to encompass the entire perturbation. In other 
words, the largest scale on which the density fluctuation spectrum P^ik) has appre- 
ciable power must be much smaller than the Hubble radius c/Hq. (ii) The Newto- 
nian potential of the perturbations is small, ^> <^ c^, and typical velocities are much 
smaller than the speed of light, (iii) Relative deflection angles between neighbour- 
ing light rays are small enough so that the difference of the transverse potential 
gradient can be evaluated at the unperturbed path separation //f (w)9 rather than the 
actual one. Reassuringly, these approximations are very comfortably satisfied even 
under fairly extreme conditions. The curvature radius of the Universe is of order 
cHq^ = 3000 /z"^ Mpc and therefore much larger than perturbations of even several 
tens of Mpc's in size. Typical velocities in galaxy clusters are of order 10^kms~\ 
much smaller than the speed of light, and typical Newtonian potentials are of order 



6.3 Effective Convergence 



6. 3. 1 Definition and Derivation 

In the thin-lens approximation, convergence K and deflection angle a are related by 

K(e) = ive-6(e) = i^, (6.13) 

where summation over i is implied. In exact analogy, an effective convergence 
Keff (w) can be defined for cosmological weak lensing. 



Keff(e,w) = ^Ve-a(e,w) 

= ^ / dw — ^-3-4) /i:(w)e,w . (6.14) 

c^ Jo fxiw) dXidXi 
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Had we not replaced x{Q,w') by fK{w')Q following eq. (6.9), eq. (6.14) would 
have contained second and higher-order terms in the potential derivatives. Since 
eq. (6.9) is a Volterra integral equation of the second kind, its solution (and deriva- 
tives thereof) can be expanded in a series, of which the foregoing expression for 
Keff is the first term. Equation (6.16) below shows that this term is of the order 
of the line-of-sight average of the density contrast 5. The next higher-order term, 
explicitly written down in the Appendix of Schneider et al. (1998a), is determined 
by the product S{w') 5(w"), averaged along the line-of-sight over w' < w" . Analo- 
gous estimates apply to higher-order terms. Whereas the density contrast may be 
large for individual density perturbations passed by a light ray, the average of 5 is 
small compared to unity for most rays, hence Kgff <^ 1, and higher-order terms are 
accordingly negligible. 

The effective convergence K^ff in eq. (6. 14) involves the two-dimensional Laplacian 
of the potential. We can augment it by (3^^>/3j:3) which involves only derivatives 
along the light path, because these average to zero in the limit to which we are 
working. The three-dimensional Laplacian of the potential can then be replaced by 
the density contrast via Poisson's equation (2.65, page 32), 

A^>=^^5. (6.15) 
2a 

Hence, we find for the effective convergence, 

2c^ h JK\yv) fl(w') 

The effective convergence along a light ray is therefore an integral over the density 
contrast along the (unperturbed) light path, weighted by a combination of comoving 
angular-diameter distance factors, and the scale factor a. The amplitude of Kgfr is 
proportional to the cosmic density parameter 

Expression (6.16) gives the effective convergence for a fixed source redshift corre- 
sponding to the comoving source distance w. When the sources are distributed in 
comoving distance, Kgff (6, w) needs to be averaged over the (normalised) source- 
distance distribution G(w), 

KefF(e)=/ dwG(w)Keff(e,w), (6.17) 
JO 

where G{w) dw = Pziz) dz. Suitably re-arranging the integration limits, we can then 
write the source-distance weighted effective convergence as 

^,(e) = ?fp2 /- A(w) MH , (6.18) 

2c^ ^0 «(w) 
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where the weighting function W(w) is now 



W{w) = n dw'G(w') ^"^f. . (6.19) 

Jw Jk{W ) 

The upper integration boundary wh is the horizon distance, defined as the comoving 
distance obtained for infinite redshift. In fact, it is easily shown that the effective 
convergence can be written as 

Keff = y dz-^-^^ ^(P-Ph (6.20) 

and the weighting function W is the distance ratio (Dds/Og), averaged over the 
source distances at fixed lens distance. Naively generalising the definition of the 
dimension-less surface-mass density (3.7, page 48) to a three-dimensional matter 
distribution would therefore directly have led to the cosmologically correct expres- 
sion for the effective convergence. 



6.4 Effective-Convergence Power Spectrum 



6. 4. 1 The Power Spectrum from Limber 's Equation 

Here, we are interested in the statistical properties of the effective convergence Kgff , 
especially its power spectrum P-JJ). We refer the reader to Sect. 2.4 (page 41) for 

the definition of the power spectrum. We also note that the expression for iCeff (9) is 
of the form (2.77, page 43), and so the power spectrum ^^(0 given in terms of 
P^{k) by eq. (2.84, page 44), if one sets 

qi{w) = q2{w) = \%^qW{w) ^ . (6.21) 
2 c^ a(w) 

We therefore obtain 

, , 9Hf{^l ra W^(w) ( I \ 
Ml) = / dw^f-^Pg -— , (6.22) 

4c* Jo a^{w) \fK{w) J 

with the weighting function W given in eq. (6.19). This power spectrum is the 
central quantity for the discussion in the remainder of this chapter. 

Figure 15 shows Pk{1) for five different realisations of the CDM cosmogony. 
These are the four models whose parameters are detailed in Tab. 1, all with 

non-linearly evolving density power spectrum Pg, using the prescription of 
Peacock & Dodds (1996), plus the SCDM model with linearly evolving P§. Sources 
are assumed to be at redshift Zs — Curves 1 and 2 (solid and dotted; SCDM with 
linear and non-linear evolution, respectively) illustrate the impact of non-linear 
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density evolution in an Einstein-de Sitter universe with cluster-normalised density 
fluctuations. Non-linear effects set in on angular scales below a few times 10', and 
increase the amplitude of /k(0 by more than an order of magnitude on scales of 
1'. Curve 3 (short-dashed; aCDM), obtained for CDM normalised to Og = 1 
rather than the cluster abundance, demonstrates the potential influence of different 
choices for the power- spectrum normalisation. Curves 4 and 5 (dashed-dotted and 
long-dashed; OCDM and ACDM, respectively) show Pk(0 foJ" cluster-normalised 
CDM in an open universe (Q-q ~ 0.3, D.a — 0) and in a spatially flat, low-density 
universe (D.q = 0.3, — 0.7). It is a consequence of the normalisation to the 
local cluster abundance that the various Pk{1) are very similar for the different cos- 
mologies on angular scales of a few arc minutes. For the low-density universes, the 
difference between the cluster- and the ag normalisation is substantially smaller 
than for the Einstein-de Sitter model. 

Figure 16 gives another representation of the curves in Fig. 15. There, we plot 
/^Pk(0' i-S- the total power in the effective convergence per logarithmic / inter- 
val. This representation demonstrates that density fluctuations on angular scales 
smaller than pa 10' contribute most strongly to weak gravitational lensing by large- 
scale structures. On angular scales smaller than 1', the curves level off and then 
decrease very gradually. The solid curve in Fig 16 shows that, when linear den- 
sity evolution is assumed, most power is contributed by structures on scales above 
10', emphasising that it is crucial to take non-linear evolution into account to avoid 
misleading conclusions. 



6.4.2 Special Cases 

In the approximation of linear density evolution, applicable on large angular scales 
> 30', the density contrast grows in proportion with ag{a), as described following 
eq. (2.52) on page 26. The power spectrum of the density contrast then evolves 
oc a^g^(a). Inserting this into eq. (6.22), the squared scale factor a^{w) cancels, 
and we find 

^-^^) = I dwg2[a(w)] w\w)P', j . (6.23) 

Here, P^{k) is the density-contrast power spectrum linearly extrapolated to the 
present epoch. 

In an Einstein-de Sitter universe, the growth function g{a) is unity since P5 grows 
like the squared scale factor. In that special case, the expression for the power spec- 
trum of iCeff further reduces to 
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SCDM, linear 
SCDM, nonlinear 
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Fig. 15. Five effective-convergence power spectra Pk{1) are shown as functions of the an- 
gular scale 27i/~\ expressed in arc minutes. All sources were assumed to lie at Zs = 1- 
The five curves represent the four realisations of the CDM cosmogony listed in Tab. 1 , all 
with non-linearly evolving density-perturbation power spectra P^, plus the SCDM model 
with linearly evolving Pg. Solid curve (1): Linearly evolving SCDM model; dotted curve 
(2): non-linearly evolving SCDM; short-dashed curve (3): non-linearly evolving aCDM; 
dashed-dotted and long-dashed curves (4 and 5): non-linearly evolving OCDM and ACDM, 
respectively. 

and the weight function W simplifies to 



In some situations, the distance distribution of the sources can be approximated by 
a delta peak at some distance Wg, G{w) = 5d(w — Ws). A typical example is weak 
lensing of the Cosmic Microwave Background, where the source is the surface of 




(6.25) 
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Fig. 16. Different representation of the curves in Fig. 15. We plot here f'Py^{l), representing 
the total power in the effective convergence per logarithmic / interval. See the caption of 
Fig. 15 for the meaning of the different line types. The figure demonstrates that the total 
power increases monotonically towards small angular scales when non-linear evolution is 
taken into account (i.e. with the exception of the solid curve). On angular scales still smaller 
than K, 1', the curves level off and decrease very slowly. This shows that weak lensing by 
cosmological mass distributions is mostly sensitive to structures smaller than 10'. 



last scattering at redshift Zs ~ 1000. Under such circumstances, 



W{w) =(^-^ H(ws - w) , 



(6.26) 



where the Heaviside step function H(x) expresses the fact that sources at are 
only lensed by mass distributions at smaller distance w. For this specific case, the 
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effective-convergence power spectrum reads 



^«(') = ^-s/;d,(l-,)^^(^), (6.27) 

where j = vv/wg is the distance ratio between lenses and sources. This equation 
illustrates that all density-perturbation modes whose wave numbers are larger than 
^min = w~^l contribute to / k(/), or whose wavelengths are smaller than Xmax = WsB. 
For example, the power spectrum of weak lensing on angular scales of 6 10' on 
sources at redshifts Zs ~ 2 originates from all density perturbations smaller than 
^ 7/?^^Mpc. This result immediately illustrates the limitations of the foregoing 
approximations. Density perturbations on scales smaller than a few Mpc become 
non-linear even at moderate redshifts, and the assumption of linear evolution breaks 
down. 



6.5 Magnification and Shear 



In analogy to the Jacobian matrix M. of the conventional lens equation (3.11, 
page 49), we now form the matrix 

Ji{e,w) = i ^ = -— - \ ' ^ . 6.28 

The magnification is the inverse of the determinant of Jl (see eq. 3.14, page 49). 
To first order in the perturbations, we obtain for the magnification of a source at 

— * 

distance w seen in direction 6 



^(6, w) = + a(0, w) = 1 + 2Keff (0, w) 

detJ^(6,w) 

= l + 5//(e,w). (6.29) 

In the weak-lensing approximation, the magnification fluctuation d/u is simply twice 
the effective convergence Keff, just as in the thin-lens approximation. 

We emphasise again that the approximations made imply that the matrix R is sym- 
metric. In general, when higher-order terms in the Newtonian potential are con- 
sidered, Jl attains an asymmetric contribution. Jain et al. (1999) used ray-tracing 
simulations through the density distribution of the Universe computed in very high 
resolution AT-body simulations to show that the symmetry of Jl is satisfied to very 
high accuracy. Only for those light rays which happen to propagate close to more 
than one strong deflector can the deviation from symmetry be appreciable. Fur- 
ther estimates of the validity of the various approximations have been carried out 
analytically by Bemardeau et al. (1997) and Schneider et al. (1998a). 
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Therefore, as in the single lens-plane situation, the anisotropic deformation, or 
shear, of a light bundle is determined by the trace-free part of the matrix Jl 
(cf. eq. 3.11, page 49). As explained there, the shear makes elliptical images from 
circular sources. Let a and b be the major and minor axes of the image ellipse of a 
circular source, respectively, then the ellipticity is 



where the latter approximation is valid for weak lensing, |y| ^ 1; cf. eq. (4.18). 
The quantity 2y was sometimes called polarisation in the literature 
(Blandford et al. 1991, Miralda-Escude 1991a, Kaiser 1992). 

In the limit of weak lensing which is relevant here, the two-point statistical prop- 
erties of dfj and of 2Yare identical (e.g. Blandford et al. 1991). To see this, we first 
note that the first derivatives of the deflection angle occurring in eqs. (6.29) can be 
written as second derivatives of an effective deflection potential \|/ which is defined 
in terms of the effective surface mass density Kgff in the same way as in the sin- 
gle lens-plane case; see (3.9, page 48). We then imagine that 5/j and y are Fourier 
transformed, whereupon the derivatives with respect to 9, are replaced by multipli- 
cations with components of the wave vector / conjugate to 0. In Fourier space, the 
expressions for the averaged quantities (S//-^) and 4 (lyp) differ only by the combi- 
nations of /i and I2 which appear under the average. We have 



and hence the two-point statistical properties of d/u and 2y agree identically. There- 
fore, the power spectra of effective convergence and shear agree. 



Thus we can concentrate on the statistics of either the magnification fluctuations or 
the shear only. Since 5/j = 2Keff, the magnification power spectrum is 4Pk, and 
we can immediately employ the convergence power spectrum P^. 

6. 6 Second- Order Statistical Measures 

We aim at the statistical properties of the magnification fluctuation and the shear. In 
particular, we are interested in the amplitude of these quantities and their angular 
coherence. Both can be described by their angular auto-correlation functions, or 
other second-order statistical measures that will turn out to be more practical later. 
As long as the density fluctuation field 5 remains Gaussian, the probability distri- 
butions of d/Li and y are also Gaussians with mean zero, and two-point statistical 




(6.30) 



+ 4/2/2= |r|4 4(|y|2)=4(y2 + '^) ' 



(6.31) 



(6.32) 
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measures are sufficient for tlieir complete statistical description. When non-linear 
evolution of the density contrast sets in, non-Gaussianity develops, and higher- 
order statistical measures become important. 

6. 6. 1 Angular Auto- Correlation Function 

The angular autocorrelation function ^^({|)) of some isotropic quantity ^(6) is 
the Fourier transform of the power spectrum Pq{l) of ^(6). In particular, the 
auto-correlation function of the magnification fluctuation, ^^((|)), is related to the 
effective-convergence power spectrum P-Al) through 

^^((^) = (5/1(0)5/1(0 + ft ) = 4 (Keff (0)Keff (0 + ft ) = 4 (Y(0)f (0 + ft ) 

r d^Z ^ ^ Idl 

= 4 J ^PKWexpl-i/-^) =4 Jo(/^) , (6.33) 

where $ is a vector with norm ()). The factor four in front of the integral accounts for 
the fact that 5/i = 2Keff in the weak-lensing approximation. For the last equality in 

— * — * 

(6.33), we integrated over the angle enclosed by / and (j), leading to the zeroth-order 
Bessel function of the first kind, Jo(x). Equation (6.33) shows that the magnification 
(or shear) auto-correlation function is an integral over the power spectrum of the 
effective convergence Kgff, filtered by the Bessel function Jo{x). Since the latter is 
a broad-band filter, the magnification auto-correlation function is not well suited 
for extracting information on P^. It would be desirable to replace ^^{^) by another 
measurable quantity which involves a narrow-band filter. 

Nonetheless, inserting eq. (6.22) into eq. (6.33), we obtain the expression for the 
magnification auto-correlation function, 

^^'^^^ = -^1 dw/|(w)w2(w,w)a-2(w) 
r°° kdk 

X / -^P8{k,w)h[Mw)k<\>]. (6.34) 
Jo 2n 

The magnification autocorrelation function therefore turns out to be an integral over 
the density-fluctuation power spectrum weighted by a ^-space window function 
which selects the contributing density perturbation modes. 

6.6.2 Special Cases and Qualitative Expectations 

In order to gain some insight into the expected behaviour of the magnification auto- 
correlation function ^^({|)), we now make a number of simplifying assumptions. 
Let us first specialise to linear density evolution in an Einstein-de Sitter universe. 
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and assume sources are at a single distance Ws. Equation (6.34) then immediately 
simplifies to 

U^) = ^v^sj[, ^yy'(^-y)'l -^p!{k)h{wy^), (6.35) 

with J = w'^^w. 

We now introduce two model spectra P^{k), one of which has an exponential cut-off 
above some wave number while the other falls off like k^^ for k> ko. For small 
k, both spectra increase like k. They approximately describe two extreme cases of 
popular cosmogonies, the HDM and the CDM model. We choose the functional 
forms 

^•aVM=Atexp(-i), ^,„,=A*^^j^, (6.36, 

where A is the normalising amplitude of the power spectra. The numerical coeffi- 
cients in the CDM model spectrum are chosen such that both spectra peak at the 
same wave number k = kQ. Inserting these model spectra into eq. (6.35), perform- 
ing the k integration, and expanding the result in a power series in ([), we obtain 
(Bartelmann 1995b) 



3A' 9A' 
^^,hdm(^) = {wskof - — {w,kof<^^ + (^4) , 

Q\/^A' 71 a' 

^^,cdm((^) = (wsko? - ^ (Ws^o)>+ 0((^2) ^ (637) 

where A' = {Hoc-^)^A. We see from eq. (6.37) that the magnification correlation 
function for the HDM spectrum is flat to first order in (|), while it decreases linearly 
with {|) for the CDM spectrum. This demonstrates that the shape of the magnification 
autocorrelation function ^^((|)) reflects the shape of the dark-matter power spectrum. 
Motivated by the result of a large number of cosmological studies showing that 
HDM models have the severe problem of structure on small scales forming at times 
much later than observed (see e.g. Peacock 1999), we now neglect the HDM model 
and focus on the CDM power spectrum only. 

We can then expect ^^((|)) to increase linearly with (|) as (|) goes to zero. Although 
we assumed linear evolution of the power spectrum to achieve this result, this qual- 
itative behaviour remains valid when non-linear evolution is assumed, because for 
large wave numbers k, the non-linear CDM power spectra also asymptotically fall 
off ock~^ for large k. 

Although the model spectra (6.36) are of limited validity, we can extract some 
useful information from the small-angle approximations given in eq. (6.37). First, 
the correlation amplitude ^^(0) scales with the comoving distance to the sources 
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Ws as Wg. In the Einstein-de Sitter case, for which eq. (6.37) was derived, Ws = 
(2c///o) [1 - (1 +Zs)"^/^]. For low source redshifts, Zs < 1, Wg ~ {c/Hq)zs,, so 
that ^p{0) oc zj. For Zs ^ Ws ^ {2c /Hq), and ^^(0) becomes independent of 
source redshift. For intermediate source redshifts, progress can be made by defining 
= In(zs) and expanding lnw[exp(^s)] in a power series in ^s- The result is an 
approximate power-law expression, w{zs) ^ zf, valid in the vicinity of the zero 
point of the expansion. The exponent 8 changes from fa 0.6 at Zs ~ 1 to 0.38 at 
Zs ~ 3. 

Second, typical source distances are of order 2Gpc. Since ko is the wave number 
corresponding to the horizon size when relativistic and non-relativistic matter had 
equal densities, k^^ = <iH('3eq) ~ 12(no^^)^Mpc. Therefore, Wf,kQ ^ 150. Typi- 
cally, the spectral amplitude A' ranges between 10^^-10^^. A rough estimate for 
the correlation amplitude ^^(0) thus ranges between 10~^-10~^ for 'typical' source 
redshifts Zs ^ 1- 

Third, an estimate for the angular scale (|)o of the magnification correlation is ob- 
tained by determining the angle where ^^((|)) has dropped to half its maximum. 
From the small-angle approximation (6.37), we find (|)o = 7iv^(12ws^o)~^- Insert- 
ing as before Wsko ^ 150, we obtain ?a 10', decreasing with increasing source 
redshift. 

Summarising, we expect ^/ji^) in a CDM universe to 

(1) start at lO'^-iQ-^ at ^ = for source redshifts Zs ~ 1 ; 

(2) decrease linearly for small (|) on an angular scale of (|)o ~ 10'; and 

(3) increase with source redshift roughly as <x z^'^ around Zs = 1- 

6.6.3 Realistic Cases 

After this digression, we now return to realistic CDM power spectra normalised to 
fit observational constraints. Some representative results are shown in Fig. 17 for 
the model parameter sets listed in Tab. 1. 

1 /2 

The figure shows that typical values for ((|)) in cluster-normalised CDM models 
with non-linear density evolution are « 6% at (|) ?a 1', quite independent of the cos- 
mological model. The effects of non-linear evolution are considerable. Non-linear 
evolution increases the ^j/^ by factors of three to four. The uncertainty in the nor- 
malisation is illustrated by the two curves for the Einstein-de Sitter model, one of 
which was calculated with the cluster-, the other one with the Og = 1 normalisation, 

1 /2 

which yields about a factor of two larger results for . For the other cosmolog- 
ical models (OCDM and ACDM), the effects of different normalisations (cluster 
vs. COBE) are substantially smaller. 
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Fig. 17. Four pairs of magnification auto-correlation functions are shown for the cosmo- 
logical model parameter sets listed in Tab. 1, and for an assumed source redshift Zs = 1- 
For each pair, plotted with the same line type, the curve with lower amplitude at small an- 
gular scale was calculated assuming linear, and the other one non-linear density evolution. 
Solid curves: SCDM; dotted curves: aCDM; short-dashed curve: OCDM; and long-dashed 

1 /2 

curve: ACDM. Non-Unear evolution increases the amplitude of ((|)) on small angular 
scales by factors of three to four. The results for the cluster-normalised models differ fairly 

1 /2 

little. At 1', ((])) ^ 6% for non-linear density evolution. For the Einstein-de Sitter 
models, the difference between cluster- and ag = 1 normalisation amounts to about a factor 
of two in ((])). 

6.6.4 Application: Magnification Fluctuations 

At zero lag, the magnification autocorrelation function reads 

^,{0) = (^[fi{Q)-lf^^{d//) , (6.38) 

which is the variance of the magnification fluctuation 5/i. Consequently, the rms 
magnification fluctuation is 

5//nns = (5A/'>'/' = ^y'(0). (6.39) 

Figure 18 shows d/u^ms as a function of source redshift for four different realisations 
of the CDM cosmogony. For cluster-normalised CDM models, the rms magnifica- 
tion fluctuation is of order 5//nns ~ 20% for sources at Zs ~ 2, and increases to 
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Fig. 18. The rms magnification fluctuation is shown as a function of source redshift z% 
for non-linearly evolving density fluctuations in the four different realisations of the CDM 
cosmogony detailed in Tab. 1. Solid curve: SCDM; dotted curve: aCDM; short-dashed 
curve: OCDM; and long-dashed curve: ACDM. Except for the aCDM model, typical rms 
magnification fluctuations are of order 20% at Zs = 2, and 25% for Zs = 3. 

The results shown in Fig. 18 indicate that for any cosmological source, gravita- 
tional lensing causes a statistical uncertainty of its brightness. In magnitudes, a 
typical effect at Zg ~ 2 is 5m ^ 2.5 x log(1.2) ^ 0.2. This can be important for 
e.g. high-redshift supernovae of type la, which are used as cosmological stan- 
dard candles. Their intrinsic magnitude scatter is of order 5m 0.1 — 0.2 mag- 
nitudes (e.g. Phillips 1993; Riess et al. 1995, 1996; Hamuy et al. 1996). There- 
fore, the lensing-induced brightness fluctuation is comparable to the intrinsic un- 
certainty at redshifts Zs ^ 2 (Frieman 1996; Wambsganss et al. 1997; Holz 1998; 
Metcalf& Silk 1999). 

Since the magnification probability can be highly skewed, the most probable ob- 
served flux of a high-redshift supernova can deviate from the mean flux at given 
redshift, even if the intrinsic luminosity distribution is symmetric. This means that 
particular care needs to be taken in the analysis of future large SN surveys. How- 
ever, if SNe la are quasi standard candles also at high redshifts, with an intrinsic 
scatter of AL = A'!iD\^^{z)^S{z) around the mean luminosity Lq = A'!iDI^^{z)Sq{z), 
then it is possible to obtain volume-limited samples (in contrast to flux-limited sam- 
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pies) of them. 

If, for a given redshift, the sensitivity limit is chosen to be Sj^^ < /Jmin {Sq — 3 AS), 
one can be sure to find all SNe la at the redshift considered. Here, /imin is the 
minimum magnification of a source at the considered redshift. Since no source can 
be more de-magnified than one that is placed behind a hypothetical empty cone (see 
Dyer & Roder 1973 and the discussion in Sect. 4.5 of Schneider et al. 1992), /Jmin 
is not much smaller than unity. Flux conservation (e.g. Weinberg 1976) implies 
that the mean magnification of all sources at given redshift is unity, (ju(z)) = 1, 
and so the expectation value of the observed flux at given redshift is the unlensed 
flux, (S(z)) = 5o(z). It should be pointed out here that a similar relation for the 
magnitudes does not hold, since magnitude is a logarithmic measure of the flux, 
and so (m{z)) ^ wo(z). This led to some confusing conclusions in the literature 
claiming that lensing introduces a bias in cosmological parameter estimates from 
lensing, but this is not true: One just has to work in terms of fluxes rather than 
magnitudes. 

However, a broad magnification probability distribution increases the confidence 
contours for Q.q and (e.g. Holz 1998). If the probability distribution was 
known, more sensitive estimators of the cosmological model than the mean flux 
at given redshift could be constructed. Furthermore, if the intrinsic luminos- 
ity distribution of the SNe was known, the normalisation of the power spec- 
trum as a function of Q.q and Q.\ could be inferred from the broadened ob- 
served flux distribution (Metcalf 1999). If the dark matter is in the form of com- 
pact objects with mass > lO'^M©, these objects can individually magnify a SN 
(Schneider & Wagoner 1987), additionally broadening the magnification probabil- 
ity distribution and thus enabling the nature of dark matter to be tested through SN 
observations (Metcalf & Silk 1999, Seljak & Holz 1999). 

6. 6. 5 Shear in Apertures 

We mentioned below eq. (6.33) that measures of cosmic magnification or 
shear other than the angular auto-correlation function which filter the effective- 
convergence power spectrum Pk with a function narrower than the Bessel function 
Jo{x) would be desirable. In practice, a convenient measure would be the variance 
of the effective convergence within a circular aperture of radius 0. Within such an 
aperture, the averaged effective convergence and shear are 

Kav(e) = I'^^csi^) ' Yav(e) = J^^ ^li^) > (6-40) 

and their variance is 

{Kim = (iCeff(ftiCe£f($')) = (lYavP)(e) • (6.41) 
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The remaining average is the effective-convergence auto-correlation function 
^k(|^ — ^1)^ which can be expressed in terms of the power spectrum /k- The final 
equality follows from = ^y- Inserting (6.41) and performing the angular integrals 
yields 



poo 

(k^^)(0) = 27I / /d/PK(/) 
^0 



Ji(/9) 

71/e 



(lYav|')(e), 



(6.42) 



where Ji(jc) is the first-order Bessel function of the first kind. Results for the rms 
shear in apertures of varying size are shown in Fig. 19 (cf. Blandford et al. 1991). 
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Fig. 19. The rms shear Yrms(9) in circular apertures of radius 6 is plotted as a function of 
9 for the four different realisations of the CDM cosmogony detailed in Tab. 1, where all 
sources are assumed to be at redshift Zs = 1 ■ A pair of curves is plotted for each realisation, 
where for each pair the curve with lower amplitude at small 9 is for linearly, the other one 
for non-linearly evolving density fluctuations. Solid curves: SCDM; dotted curves: aCDM; 
short-dashed curves: OCDM; and long-dashed curves: ACDM. For the cluster-normalised 
models, typical rms shear values are 3% for 9 1'. Non-linear evolution increases the 
amplitude by about a factor of two at 9 « 1' over linear evolution. 
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6.6.6 Aperture Mass 



Another measure for the effects of weak lensing, the aperture mass Map(0) 
(cf. Sect. 5.3.1), was introduced for cosmic shear by Schneider et al. (1998a) as 



Map(e) = j^^ d2(|)C/((|))Keff(ft , (6.43) 
where the weight function U (([)) satisfies the criterion 

.6 

/ ^d^f/(^) = 0. (6.44) 

In other words, U ((|)) is taken to be a compensated radial weight function across the 
aperture. For such weight functions, the aperture mass can be expressed in terms of 
the tangential component of the observable shear relative to the aperture centre, 

Map(e)=^*^d2^!2(^)Yt((^), (6.45) 
where is related to U ((|)) by (5.23). Map is a scalar quantity directly measurable 



in terms of the shear. The variance of Map reads 



poo 

{M%){%) = ln IdlP^il) 







e 

^d(|)C/((|)) Jo(/(|)) 



(6.46) 



Equations (6.42) and (6.46) provide alternative observable quantities which are re- 
lated to the effective-convergence power spectrum Pk through narrower filters than 
the auto-correlation function ^k- The Map statistic in particular permits one to tune 
the filter function through different choices of U ((|)) within the constraint (6.44). 
It is important that Map can also be expressed in terms of the shear [see eq. (5.26, 
page 98)], so that Map can directly be obtained from the observed galaxy elliptici- 
ties. 

Schneider et al. (1998a) 

suggested a family of radial filter functions U ((|)), the simplest of which is 



t/(^) = ^(l-^)Q-^') , !2(^) = ^x2(l-x2), (6.47) 



where jc0 = (j). With this choice, the variance (M^p) (6) becomes 



poo 

(M2p)(0)=27iy^ ldlP^{l)j\lQ), (6.48) 



with the filter function 



y(Ti) = i^J4(Ti), (6.49) 
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where J4(t|) is the fourth-order Bessel function of the first kind. Examples for the 
rms aperture mass, Map,nns(6) = (M^p)^/^(e), are shown in Fig. 20. 
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Fig. 20. The rms aperture mass, Map,rms(9)5 is shown in dependence of aperture radius 
9 for the four different realisations of the CDM cosmogony detailed in Tab. 1 where all 
sources are assumed to be at redshift Zs = 1 ■ For each realisation, a pair of curves is plotted; 
one curve with lower amplitude for linear, and the second curve for non-linear density 
evolution. Solid curves: SCDM; dotted curves: aCDM; short-dashed curves: OCDM; and 
long-dashed curves: ACDM. Non-linear evolution has a pronounced effect: The ampUtude 
is approximately doubled, and the peak shifts from degree- to arc-minute scales. 

The curves look substantially different from those shown in Figs. 17 and 19. Unlike 
there, the aperture mass does not increase monotonically as 0, but reaches a 
maximum at finite and drops for smaller angles. When non-linear evolution of the 
density fluctuations is assumed, the maximum occurs at much smaller than for 
linear evolution: Linear evolution predicts the peak at angles of order one degree, 
non-linear evolution around 1' ! The amplitude of Map,mis(6) reaches fa 1% for 
cluster-normalised models, quite independent of the cosmological parameters. 

Some insight into the expected amplitude and shape of {mI^){Q) can be gained by 
noting that /^(t|) is well approximated by a Gaussian, 

y^(ri) ^ A exp 

with mean Tio ~ 4. 1 1, amplitude A 4.52 x 10~^, and width a 1 .24. At aperture 



(T1-rio)^ 
2a2 



(6.50) 
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radii of «i 1', the peak rjo ~ 4.11 corresponds to angular scales of 2iil~^ ^ 1.6', 
where the total power I^Pk{1) in the effective convergence is close to its broad max- 
imum (cf. Fig. 16). The filter function J^{r\) is therefore fairly narrow. Its relative 
width corresponds to an / range of dl/l^ <s/^o ^ 0-3. Thus, the contributing range 
of modes / in the integral (6.48) is very small. Crudely approximating the Gaussian 
by a delta distribution, 

/2(Ti)RiAv^a5D(Tl-Tio) , (6.51) 

we are led to 

(Ml) « «> . (I) (I) « 2. 15 X 10-^ l^„PM , (6.52) 

with Iq = r|o0 ^ Hence, the mean-square aperture mass is expected to directly 
yield the total power in the effective-convergence power spectrum, scaled down by 
a factor of 2. 15 x IQ-^. We saw in Fig. 16 that / Vk(/) fa 3 x IQ-^ for 271/"^ ^ 1' 
in cluster-normalised CDM models, so that 

(M2p)V2~o.8% at 0Ril' (6.53) 

for sources at redshift unity. We compare Map.rms(0) and the approximation 
^ap,rms(0) in Fig- 21. Obviously, the approximation is excellent for > 10', but 
even for smaller aperture radii of ~ 1' the relative deviation is less than ^ 5%. At 
this point, the prime virtue of the narrow filter function /(r|) shows up most promi- 
nently. Up to relatively small errors of a few per cent, the rms aperture mass very 
accurately reflects the effective-convergence power spectrum Pk(0- Observations 
of Map,rms(0) are therefore most suitable to obtain information on the matter power 
spectrum (cf. Bartelmann & Schneider 1999). 



6. 6. 7 Power Spectrum and Filter Functions 

The three statistical measures discussed above, the magnification (or, equivalently, 
the shear) auto-correlation function the mean-square shear in apertures (y^), 
and the mean-square aperture mass (M^p), are related to the effective-convergence 
power spectrum P^ in very similar ways. According to eqs. (6.33), (6.42), and 
(6.48), they can all be written in the form 

2(0) = 271/ ldlPM)F(ie), (6.54) 

^0 
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Fig. 21. The rms aperture mass Maprms(0) is shown together with the approximation 
Map,rms(9) of cq. (6.52). The three curves correspond to the three cluster-normaUsed cosmo- 
logical models (SCDM, OCDM and ACDM) introduced in Tab. 1 for non-linearly evolving 
matter perturbations. All sources were assumed to be at redshift Zs = 1- Clearly, the rms 
aperture mass is very accurately approximated by Map rms on angular scales > 10', and 
even for smaller aperture sizes of order ~ 1' the deviation between the curves is smaller 
than ?s 5%. The observable rms aperture mass therefore provides a very direct measure for 
the effective-convergence power spectrum Px{l)- 

where the filter functions F(t|) are given by 



r Jo(Ti) 



71^ 



-I 2 



Jl(Tl) 

12J4(ri)' 



for Q = ^^ 
for Q={t 



(6.55) 



for Q={M[ 



ap/ 



Figure 22 shows these three filter functions as functions of r] = IQ. Firstly, the 
curves illustrate that the amplitude of is largest (owing to the factor of four 
relative to the definition of ^y), and that of (M^p) is smallest because the amplitudes 

of the filter functions themselves decrease. Secondly, it becomes evident that, for 
given 0, the range of I modes of the effective-convergence power spectrum /k(0 
convolved into the weak-lensing estimator is largest for and smallest for (M^p). 
Thirdly, the envelope of the filter functions for large T| decreases most slowly for 
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Fig. 22. The three filter functions F{r\) defined in eq. (6.55) are shown as functions of 
r\ = IQ. They occur in the expressions for the magnification auto-correlation function, 
(solid curve), the mean-square shear in apertures, (7^) (dotted curve), and the mean-square 
aperture mass, (M^p) (dashed curve). 

^,IJ and most rapidly for (M^p). Although the aperture mass has the smallest signal 
amplitude, it is a much better probe for the effective-convergence power spectrum 
Pk{1) than the other measures because it picks up the smallest range of / modes and 
most strongly suppresses the / modes smaller or larger than its peak location. 

We can therefore conclude that, while the strongest weak-lensing signal is picked 
up by the magnification auto-correlation function the aperture mass is the 
weak-lensing estimator most suitable for extracting information on the effective- 
convergence power spectrum. 



6. 6. 8 Signal-to-Noise Estimate of Aperture-Mass Measurements 

The question then arises whether the aperture mass can be measured with suffi- 
cient significance in upcoming wide-field imaging surveys. In practice, Map is de- 
rived from observations of image distortions of faint background galaxies, using 
eq. (5.26, page 98) and replacing the integral by a sum over galaxy ellipticities. 
If we consider A^ap independent apertures with Ni galaxies in the /-th aperture, an 
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unbiased estimator of (M^p) is 



'ap/ 

(7^02)2 A'ap ^ Ni 

^ = -jr~ ^ N-(N--l) ^ 2,7e.*et,7 et,ik , (6.56) 

where is the value of the weight function at the position of the j-th galaxy in 
the i-th aperture, and Etjj is defined accordingly. 

The noise properties of this estimator were investigated in Schneider et al. (1998a). 
One source of noise comes from the fact that galaxies are not intrinsically circular, 
but rather have an intrinsic ellipticity distribution. A second contribution to the 
noise is due to the random galaxy positions, and a third one to cosmic (or sampling) 
variance. Under the assumptions that the number of galaxies A^, in the apertures is 
large, 1 , it turns out that the second of these contributions can be neglected 
compared to the other two. For this case, and assuming for simplicity that all A', are 
equal. A'; = A'^, the signal-to-noise of the estimator becomes 



S ^ (Mi) _ 1/2 



5V2N{Ml^) 



(6.57) 



where Oe ~ 0.2 (e.g. Hudson et al. 1998) is the dispersion of the intrinsic galaxy 
ellipticities, and /J4 = (M^p) / (M^)^ — 3 is the curtosis of Map, which vanishes for 
a Gaussian distribution. The two terms of (6.57) in parentheses represent the noise 
contributions from Gaussian sampling variance and the intrinsic ellipticity distri- 
bution, respectively, and /14 accounts for sampling variance in excess of that for 
a Gaussian distribution. On angular scales of a few arc minutes and smaller, the 
intrinsic ellipticities dominate the noise, while the cosmic variance dominates on 
larger scales. 



Another convenient and useful property of the aperture mass Map follows from 
its filter function being narrow, namely that Map is a well localised measure 
of cosmic weak lensing. This implies that Map measurements in neighbouring 
apertures are almost uncorrelated even if the aperture centres are very close 
(Schneider et al. 1998a). It is therefore possible to gain a large number of (almost) 
independent Map measurements from a single large data field by covering the field 
densely with apertures. This is a significant advantage over the other two measures 
for weak lensing discussed above, whose broad filter functions introduce consider- 
able correlation between neighbouring measurements, implying that for their mea- 
surement imaging data on widely separated fields are needed to ensure statistical 
independence. Therefore, a meaningful strategy to measure cosmic shear consists 
in taking a large data field, covering it densely with apertures of varying radius 
0, and determining (M^) in them via the ellipticities of galaxy images. Figure 23 
shows an example for the signal-to-noise ratio of such a measurement that can be 
expected as a function of aperture radius 0. 
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Fig. 23. The signal-to-noise ratio S/N(e) of measurements of mean-square aperture masses 
(Mgp) is plotted as a function of aperture radius 9 for an experimental setup as described 
in the text. The curtosis was set to zero here. The four curves are for the four different 
realisations of the CDM cosmogony listed in Tab. 1. Solid curve: SCDM; dotted curve: 
aCDM; short-dashed curve: OCDM; and long-dashed curve: ACDM. Quite independently 
of the cosmological parameters, the signal-to-noise ratio S/N reaches values of > 10 on 
scales of 1' — 2'. 

Computing the curves in Fig. 23, we assumed that a data field of size 5° x 5° is 
available which is densely covered by apertures of radius 0, hence the number of 
(almost) independent apertures is A'ap = (300Y29)^. The number density of galax- 
ies was taken as 30arcmin^^, and the intrinsic ellipticity dispersion was assumed to 
be Os = 0.2. Evidently, high signal-to-noise ratios of > 10 are reached on angular 
scales of 1' in cluster-normalised universes quite independent of the cosmolog- 
ical parameters. The decline of S /N for large 6 is due to the decreasing number 
of independent apertures on the data field, whereas the decline for small is due 
to the decrease of the signal (M^p), as seen in Fig. 20. We also note that for cal- 
culating the curves in Fig. 23, we have put ^4 = 0. This is likely to be an overly 
optimistic assumption for small angular scales where the density field is highly 
non-linear. Unfortunately, //4 caimot easily be estimated analytically. It was numer- 
ically derived from ray-tracing through A'^-body simulations of large-scale matter 
distributions by Reblinsky et al. (1999). The curtosis exceeds unity even on scales 
as large as 10', demonstrating the highly non-Gaussian nature of the non-linearly 
developed density perturbations. 
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Although the aperture mass is a very convenient measure of cosmic shear and pro- 
vides a localised estimate of the projected power spectrum Pycil) [see (6.52)], it is 
by no means clear that it is an optimal measure for the projected power spectrum. 
Kaiser (1998) considered the case of a square-shaped data field and employed the 
Fourier-transformed Kaiser & Squires inversion formula, eq. (5.3, page 88). The 
Fourier transform of the shear is then replaced by a sum over galaxy ellipticities 
so that Keff (0 expressed directly in terms of the The square |Keff(0 P yields an 
estimate for the power spectrum which allows a simple determination of the noise 
coming from the intrinsic ellipticity distribution. As Kaiser (1998) pointed out that, 
while this noise is very small for angular scales much smaller than the size of the 
data field, the sampling variance is much larger, so that different sampling strate- 
gies should be explored. For example, he suggests to use a sparse sampling strategy. 
Seljak (1998) developed an estimator for the power spectrum which achieves mini- 
mum variance in the case of a Gaussian field. Since the power spectrum Pk(0 devi- 
ates significantly from its linear prediction on angular scales below one degree, one 
expects that the field attains significant non-Gaussian features on smaller angular 
scales, so that this estimator does no longer need to have minimum variance. 



6. 7 Higher- Order Statistical Measures 



6.7.1 The Skewness 

As the density perturbation field 6 grows with time, it develops non-Gaussian fea- 
tures. In particular, 5 is bounded by — 1 from below and unbounded from above, and 
therefore the distribution of 5 is progressively skewed while evolution proceeds. 
The same then applies to quantities like the effective convergence Kgfr derived from 
5 (cf. Jain & Seljak 1997; Bemardeau et al. 1997; Schneider et al. 1998a). Skew- 
ness of the effective convergence can be quantified by means of the three-point 
correlator of Kgff. In order to compute that, we use expression (6.18), Fourier trans- 
form it, and also express the density contrast 5 in terms of its Fourier transform. 
Additionally, we employ the same approximation used in deriving Limber's equa- 
tion in Fourier space, namely that correlations of the density contrast along the 
line-of-sight are negligibly small. After carrying out this lengthy but straightfor- 
ward procedure, the three-point correlator of the Fourier transform of Keff reads 
(suppressing the subscript 'eff ' for brevity) 



fWH W^(w) f°° dJi-K 
(k(/i)k(/2)k(/3)) ^ \ / dw \ ' / — -exp(i/c3w) 
8c^ ^0 a^{w)f^{w) J-oo 271 
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Hats on symbols denote Fourier transforms. Note the fairly close analogy between 
(6.58) and (6.22): The three-point correlator of K is a distance-weighted integral 
over the three-point correlator of the Fourier-transformed density contrast 5. The 

— * 

fact that the three-component ^3 of the wave vector k appears only in the first factor 
5 reflects the approximation mentioned above, i.e. that correlations of 5 along the 
line-of-sight are negligible. 

Suppose now that the density contrast 5 is expanded in a perturbation series, 
5 = 1^5'^') such that 5^'^ = o( [5*^^)]'), and truncated after the second order. The three- 
point correlator of 5^^^ vanishes because 5 remains Gaussian to first perturbation or- 
der. The lowest-order, non-vanishing three-point correlator of 5 can therefore sym- 
bolically be written (S'^^^S'^^^S'^^)), plus two permutations of that expression. The 
second-order density perturbation is related to the first order through (Fry 1984; 
Goroff et al. 1986; Bouchet et al. 1992) 

5(2)(^,w) ^Dliw) J ^hi;\k%'\k-k')F(k',k-k') , (6.59) 

where 5q^^ is the first-order density perturbation linearly extrapolated to the present 
epoch, and D+{w) is the linear growth factor, Z)+(w) = a{w) g[a{w)] with g{a) 
defined in eq. (2.52) on page 26. The function F{x,y) is given by 

7 2 v|xp \y\^ J 1 \x\^\y\ 

Relation (6.59) implies that the lowest-order three-point correlator (S'^^^S^^^S^^)) 
involves four-point correlators of 5^^). For Gaussian fields like four-point cor- 
relators can be decomposed into sums of products of two-point correlators, which 

can be expressed in terms of the linearly extrapolated density power spectrum 
This leads to 



(6(i)(fci)5(i)(^2)5(2H^3)) =2(271)3^4 (w)pf(fci)/'f(fc2) 

X bYi(ki+k2 + h)F(kiM) . (6.61) 

The complete lowest-order three-point correlator of 5 is a sum of three terms, 
namely the left-hand side of (6.61) and two permutations thereof. Each permutation 
yields the same result, so that the complete correlator is three times the right-hand 
side of (6.61). We can now work our way back, inserting the three-point density 
correlator into eq. (6.58) and Fourier-transforming the result with respect to /i,2,3- 
The three-point correlator of the effective convergence so obtained can then in a 
final step be used to compute the third moment of the aperture mass. The result is 
(Schneider et al. 1998a) 
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X 



/ if^) •^'(^2e)/2(iri+r2|0)F(ri,r2) , (6.62) 



with the filter function J{r\) defined in eq. (6.49). Commonly, third-order moments 
are expressed in terms of the skewness, 

where (M^p(0)) is calculated with the linearly evolved power spectrum. As seen 
earlier in eq. (6.48), (M^p) scales with the amplitude of the power spectrum, while 
(M^p) scales with the square of it. In this approximation, the skewness s{Q) is 
therefore independent of the normalisation of the power spectrum, removing that 
major uncertainty and leaving cosmological parameters as primary degrees of free- 
dom. For instance, the skewness 5 (0) is expected to scale approximately with fig ^• 
Figure 24 shows three examples. 

As expected, lower values of CIq yield larger skewness, and the skewness is re- 
duced when is increased keeping fixed. Despite the sensitivity of 5 (6) to 
the cosmological parameters, it should be noted that the source redshift distribu- 
tion [entering through needs to be known sufficiently well before attempts 
can be made at constraining cosmological parameters through measurements of 
the aperture-mass skewness. However, photometric redshift estimates are expected 
to produce sufficiently well-constrained redshift distributions in the near future 
(Connolly et al. 1995; Gwyn & Hartwick 1996; Hogg et al. 1998). 

We have confined the discussion of the skewness to the aperture mass since Map is a 
scalar measure of the cosmic shear which can directly be expressed in terms of the 
observed image ellipticities. One can of course also consider the skewness directly 
in terms of K, since K can be obtained from the observed image ellipticities through 
a mass reconstruction algorithm as described in Sect. 5. Analytical and numeri- 
cal results for this skewness have been presented in, e.g., Bemardeau et al. (1997), 
van Waerbeke et al. (1999), Jain et al. (1999) and Reblinsky et al. (1999). We shall 
discuss some of their results in Sect. 6.9.1. 



6. 7. 2 Number density of (dark) haloes 

In Sect. 5.3.1, we discussed the possibility to detect mass concentrations by their 
weak lensing effects on background galaxies by means of the aperture mass. The 
number density of mass concentrations that can be detected at a given threshold 
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Fig. 24. The skewness S (6) of the aperture mass Map(0) is shown as a function of aperture 
radius 9 for three of the reaUsations of the cluster-normaUsed CDM cosmogony Usted in 
Tab. 1: SCDM (sohd curve); OCDM (dotted curve); and ACDM (dashed curve). The source 
redshift was assumed to be Zs = 1 • 

of Map depends on the cosmological model. Fixing the normalisation of the power 

spectrum so that the the local abundance of massive clusters is reproduced, the 
evolution of the density field proceeds differently in different cosmologies, and so 
the abundances will differ at redshifts z ~ 0.3 where the aperture-mass method is 
most sensitive. 

The number density of haloes above a given threshold of Map(0) can be esti- 
mated analytically, using two ingredients. First, the spatial number density of 
haloes at redshift z with mass M can be described by the Press-Schechter the- 
ory (Press & Schechter 1974), which numerical simulations (Lacey & Cole 1993, 
Lacey & Cole 1994) have shown to be a fairly accurate approximation. Second, in 
a series of very large A^-body simulations, Navarro et al. (1996, 1997) found that 
dark matter haloes have a universal density profile which can be described by two 
parameters, the halo mass and a characteristic scale length, which depends on the 
cosmological model and the redshift. Combining these two results from cosmol- 
ogy, Kruse & Schneider (1999b) calculated the number density of haloes exceed- 
ing Map. Using the signal-to-noise estimate eq. (6.57), a threshold value of Map 
can be directly translated into a signal-to-noise threshold Sc- For an assumed num- 
ber density of n = 30arcmin^^ and an ellipticity dispersion Og = 0.2, one finds 
Sc ^ O.O16(e/larcmin)Map(0). 
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For the redshift distribution (2.69, page 35) with P = 3/2 and zo = 1, the number 
density of haloes with 5c > 5 exceeds 10 per square degree for cluster-normalised 
cosmologies, across angular scales 1' < < 10', and these haloes have a broad red- 
shift distribution which peaks at ~ 0-3. This implies that a wide-field imaging 
survey should be able to detect a statistically interesting sample of medium redshift 
haloes, thus allowing the definition of a mass-selected sample of haloes. Such a 
sample will be of utmost interest for cosmology, since the halo abundance is con- 
sidered to be one of the most sensitive cosmological probes (e.g.. Eke et al. 1996, 
Bahcall & Fan 1998). Current attempts to apply this tool are hampered by the fact 
that haloes are selected either by the X-ray properties or by their galaxy content. 
These properties are much more difficult to predict than the dark matter distribution 
of haloes which can directly be determined from cosmological A^-body simulations. 
Thus, these mass-selected haloes will provide a much closer link to cosmological 
predictions than currently possible. Kruse & Schneider (1999b) estimated that an 
imaging survey of several square degrees will allow one to distinguish between the 
cosmological models given in Table 1, owing to the different number density of 
haloes that they predict. Using the aperture-mass statistics, Erben et al. (1999) re- 
cently detected a highly significant matter concentration on two independent wide- 
field images centred on the galaxy cluster A 1942. This matter concentration 7' 
South of A 1942 is not associated with an overdensity of bright foreground galax- 
ies, which sets strong lower limits on the mass-to-light ratio of this putative cluster. 

6. 8 Cosmic Shear and Biasing 

Up to now, we have only considered the mass properties of the large-scale structure 
and tried to measure them with weak lensing techniques. An interesting question 
arises when the luminous constituents of the Universe are taken into account. Most 
importantly, the galaxies are supposed to be strongly tied to the distribution of 
dark matter. In fact, this assumption underlies all attempts to determine the power 
spectrum of cosmic density fluctuations from the observed distribution of galaxies. 
The relation between the galaxy and dark-matter distributions is parameterised by 
the so-called biasing factor b (Kaiser 1984), which is defined such that the relative 
fluctuations in the spatial number density of galaxies are b times the relative density 
fluctuations 6, 



where (n) denotes the mean spatial number density of galaxies at the given redshift. 
The bias factor b is not really a single number, but generally depends on redshift, on 

the spatial scale, and on the galaxy type (see, e.g., Efstathiou 1996, Peacock 1997, 
Kauffmann et al. 1997, Coles et al. 1998). Typical values for the bias factor are as- 
sumed to be Z? ~ 1 — 2 at the current epoch, but can increase towards higher red- 
shifts. The clustering properties of UV dropout galaxies (Steidel et al. 1998) indi- 




(6.64) 
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cate that b can be as large as 5 at redshifts z ~ 3, depending on the cosmology. 



The projected surface mass density Keff (0) should therefore be correlated with the 
number density of (foreground) galaxies in that direction. Let Gg(w) be the distri- 
bution function of a suitably chosen population of galaxies in comoving distance 
(which can be readily converted to a redshift probability distribution). Then, assum- 
ing that b is independent of scale and redshift, the number density of the galaxies 
is 



no (9) = («g) 



1+b j dwGG(w)5(/ji:(w)e,w) 



(6.65) 



where (no) is the mean number density of the galaxy population. The distribu- 
tion function Gq{w) depends on the selection of galaxies. For example, for a flux- 
limited sample it may be of the form (2.69). Narrower distribution functions can 
be achieved by selecting galaxies in multi-colour space using photometric redshift 
techniques. The correlation function between nQ(d) and Keff(9) can directly be ob- 
tained from eq. (2.83) by identifying q\{w) = 3HQn.oW{w)fK{w)/[2c^a{w)] [see 
eq. (6.18)], and q2{w) = {nQ)bGG{w). It reads 



Ic^ J a\w) 

r dkk 

X J -^P8{k,w)Jo{fK{w)dk) . (6.66) 

Similar equations were derived by, e.g.. Kaiser (1992), Bartelmann (1995b), 
Dolag & Bartelmann (1997), Sanz et al. (1997). 

One way to study the correlation between foreground galaxies and the projected 
density field consists in correlating the aperture mass Map(0) with a similarly fil- 
tered galaxy number density, defined as 

j d^iJC/d^DnoW , (6.67) 

with the same filter function U as in Map. The correlation between Map(0) and 
9{^{Q) then becomes 



m = (M^WK.m = j d2<»f/(|«|) j ciVc/(|e'|fe(|e-«'|) (6.68) 

\c J J a{w)fK{w) J \fK[W) ) 

where we used eq. (2.83) for the correlation function ^qk in the final step. The filter 
function J is defined in eq. (6.49). Note that this correlation function filters out the 
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power spectrum P5 at redshifts where the foreground galaxies are situated. Thus, 
by selecting galaxy populations with narrow redshift distribution, one can study the 
cosmological evolution of the power spectrum or, more accurately, the product of 
the power spectrum and the bias factor. 

The convenient property of this correlation function is that one can define an unbi- 
ased estimator for ^ in terms of observables. If A^b galaxies are found in an aperture 
of radius at positions with tangential ellipticity Eu, and A^f foreground galaxies 
at positions ^>i, then 

m = ^LQm)^ul,u{\^>k\) (6.69) 

(=1 k=l 

is an unbiased estimator for ^(0). Schneider (1998) calculated the noise proper- 
ties of this estimator, concentrating on an Einstein-de Sitter model and a linearly 
evolving power spectrum which can locally be approximated by a power law in k. 
A more general and thorough treatment is given in Van Waerbeke (1998), where 
various cosmological models and the non-linear power spectrum are considered. 
Van Waerbeke (1998) assumed a broad redshift distribution for the background 
galaxies, but a relatively narrow redshift distribution for the foreground galaxies, 
with dzd/zd ~ 0.3. For an open model with = 0.3, ^(0) declines much faster 
with than for flat models, implying that open models have relatively more power 
on small scales at intermediate redshift. This is a consequence of the behaviour of 
the growth factor D+{w); see Fig. 6 on page 27. For foreground redshifts Zd ^ 0.2, 
the signal-to-noise ratio of the estimator (6.69) for a single aperture is roughly con- 
stant for > 5', and relatively independent of the exact value of Zd over a broad 
redshift interval, with a characteristic value of ~ 0.4. 
Van Waerbeke (1998) 

also considered the ratio 

and found that it is nearly independent of 0. This result was shown in 
Schneider (1998) to hold for linearly evolving power spectra with power-law shape, 
but surprisingly it also holds for the fully non-linear power spectrum. Indeed, vary- 
ing between I' and 100', R varies by less than 2% for the models considered in 
Van Waerbeke (1998). This is an extremely important result, in that any observed 
variation of R with angular scale indicates a corresponding scale dependence of the 
bias factor b. A direct observation of this variation would provide valuable con- 
straints on the models for the formation and evolution of galaxies. 
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6.9 Numerical Approach to Cosmic Shear, Cosmological Parameter Estimates, 
and Observations 



6. 9. 1 Cosmic Shear Predictions from Cosmological Simulations 

So far, we have treated the lensing effect of the large-scale structure with analytic 
means. This was possible because of two assumptions. First, we considered only the 
lowest-order lensing effect, by employing the Bom approximation and neglecting 
lens-lens coupling in going from eq. (6.9) to eq. (6.11). Second, we used the pre- 
scription for the non-linear power spectrum as given by Peacock & Dodds (1996), 
assuming that it is a sufficiently accurate approximation. Both of these approxi- 
mations may become less accurate on small angular scales. Providing a two-point 
quantity, the analytic approximation of Pk is applicable only for two-point statistical 
measures of cosmic shear. In addition, the error introduced with these approxima- 
tions cannot be controlled, i.e., we cannot attach 'error bars' to the analytic results. 

A practical way to avoid these approximations is to study the propagation of light 
in a model universe which is generated by cosmological structure-formation simu- 
lations. They typically provide the three-dimensional mass distribution at different 
redshifts in a cube whose side-length is much smaller than the Hubble radius. The 
mass distribution along a line-of-sight can be generated by combining adjacent 
cubes from a sequence of redshifts. The cubes at different redshifts should either 
be taken from different realisations of the initial conditions, or, if this requires too 
much computing time, they should be translated and rotated such as to avoid pe- 
riodicity along the line-of-sight. The mass distribution in each cube can then be 
projected along the line-of-sight, yielding a surface mass density distribution at 
that redshift. Finally, by employing the multiple lens-plane equations, which are a 
discretisation of the propagation equation (6.9; Seitz et al. 1994), shear and magni- 
fication can be calculated along light rays within a cone whose size is determined by 
the side length of the numerical cube. This approach was followed by many authors 
(e.g., Jaroszynski et al. 1990, Jaroszynski 1991, Bartelmann & Schneider 1991, 
Blandford et al. 1991, Waxman & Miralda-Escude 1995), but the rapid devel- 
opment of //-body simulations of the cosmological dark matter distribution 
render the more recent studies particularly useful (Wambsganss et al. 1998, 
van Waerbeke et al. 1999, Jain et al. 1999). 

As mentioned below eq. (6.30), the Jacobian matrix is generally asymmet- 
ric when the propagation equation is not simplified to (6.11). Therefore, the de- 
gree of asymmetry of provides one test for the accuracy of this approximation. 
Jain et al. (1999) found that the power spectrum of the asymmetric component is 
at least three orders of magnitude smaller than that of Keff . For a second test, we 
have seen that the power spectrum of Keff should equal that of the shear in the frame 
of our approximations. This analytic prediction is very accurately satisfied in the 
numerical simulations. 
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Jain et al. (1999) 



and Reblinsky et al. (1999) found that analytic predictions of the dispersions of 
K and Map respectively, are very accurate when compared to numerical results. 
For both cosmic shear measures, however, the analytic predictions of the skew- 
ness are not satisfactory on angular scales below ~ 10'. This discrepancy reflects 
the limited accuracy of the second-order Eulerian perturbation theory employed in 
deriving the analytic results. Hui (1999) showed that the accuracy of the analytic 
predictions can be much increased by using a prescription for the highly-nonlinear 
three-point correlation function of the cosmic density contrast, as developed by 
Scoccimarro & Frieman (1999). 

The signal-to-noise ratio of the dispersion of the cosmic shear, given explicitly 
for Map in eq. (6.57), is determined by the intrinsic ellipticity dispersion of galax- 
ies and the sampling variance, expressed in terms of the curtosis. As shown in 
van Waerbeke et al. (1999) and Reblinsky et al. (1999), this curtosis is remarkably 
large. For instance, the curtosis of the aperture mass exceeds unity even on scales 
larger than 10', revealing non-Gaussianity on such large scales. Unfortunately, this 
large sampling variance implies not only that the area over which cosmic shear 
needs to be measured to achieve a given accuracy for its dispersion must be con- 
siderably larger than estimated for a Gaussian density field, but also that numerical 
estimates of cosmic shear quantities need to cover large solid angles for an accurate 
numerical determination of the relevant quantities. 

From such numerical simulations, one can not only determine moments of the 

shear distribution, but also consider its full probability distribution. For example, 
the predictions for the number density of dark matter haloes that can be detected 
through highly significant peaks of Map - see Sect. 6.7.2 - have been found by 
Reblinsky et al. (1999) to be fairly accurate, perhaps surprisingly so, given the as- 
sumptions entering the analytic results. Similarly, the extreme tail (say more than 
5 standard deviations from the mean) of the probability distribution for Map, cal- 
culated analytically in Kruse & Schneider (1999a), does agree with the numerical 
results; it decreases exponentially. 



6. 9. 2 Cosmological Parameter Estimates 

Since the cosmic shear described in this section directly probes the total matter 
content of the universe, i.e., without any reference to the relation between mass and 
luminosity, it provides an ideal tool to investigate the large-scale structure of the 
cosmological density field. Assuming the dominance of cold dark matter, the sta- 
tistical properties of the cosmic mass distribution are determined by a few param- 
eters, the most important of which are Q.q, Q.a, the shape parameter of the power 
spectrum, F, and the normalisation of the power spectrum expressed in terms of 
Og. For each set of these parameters, the corresponding cosmic shear signals can 
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be predicted, and a comparison with observations then constrains the cosmological 
parameters. 

Several approaches to this parameter estimation have been discussed in the lit- 
erature. For example, van Waerbeke et al. (1999) used numerical simulations to 
generate synthetic cosmic shear data, fixing the normalisation of the density fluc- 
tuations to Gg Ho = 0.6, which is essentially the normalisation by cluster abun- 
dance. A moderately wide and deep weak-lensing survey, covering 25 square de- 
grees and reaching a number density of 30 galaxies per arcmin^ with charac- 
teristic redshift Zs 1> will enable the distinction between an Einstein-de Sit- 
ter model and an open universe with = 0.3 at the 6-a level, though each of 
these models is degenerate in the Ho vs. Q.a plane. For this conclusion, only the 
skewness of the reconstructed effective surface mass density or the aperture mass 
was used. Kruse & Schneider (1999a) instead considered the highly non-Gaussian 
tail of the aperture mass statistics to constrain cosmological parameters, whereas 
Kruse & Schneider (1999b) considered the abundance of highly significant peaks 
of Map as a probe of the cosmological models. The peak statistics of reconstructed 
surface density maps (Jain & van Waerbeke 1999) also provides a valuable means 
to distinguish between various cosmological models. 

Future work will also involve additional information on the redshifts of the back- 
ground galaxies. Hu (1999) pointed out that splitting up the galaxy sample into 
several redshift bins substantially increases the ability to constrain cosmological 
parameters. He considered the power spectrum of the projected density and found 
that the accuracy of the corresponding cosmological parameters improves by a fac- 
tor of ~ 7 for D./^, and by a factor of ~ 3 for ^q, estimated for a median redshift of 
unity. 

All of the quoted work concentrated mainly on one particular measure of cosmic 
shear. One goal of future theoretical investigations will certainly be the construc- 
tion of a method which combines the various measures into a 'global' statistics, 
designed to minimise the volume of parameter space allowed by the data of future 
observational weak lensing surveys. Future, larger-scale numerical simulations will 
guide the search for such a statistics and allow one to make accurate predictions. 

In addition to a pure cosmic shear investigation, cosmic shear constraints can be 
used in conjunction with other measures of cosmological parameters. One impres- 
sive example has been given by Hu & Tegmark (1999), who showed that even a 
relatively small weak lensing survey could dramatically improve the accuracy of 
cosmological parameters measured by future Cosmic Microwave Background mis- 
sions. 
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6.9.3 Observations 



We are not aware of any convincing and cosmologically useful measure- 
ment of cosmic shear yet obtained. One of the first attempts was reported in 
Mould et al. (1994), where the mean shear was investigated across a field of 
9'.6 X 9'.6, observed with the Hale 5-meter Telescope. The image is very deep and 
has good quality (i.e., a seeing of FWHM). It is the same data as used by 
Brainerd et al. (1996) for the first detection of galaxy-galaxy lensing (see Sect. 8). 
The mean ellipticity of the 4363 galaxies within a circle of 4'8 radius with magni- 
tudes 23 < r < 26 was found to be (0.5 ±0.5)%. A later, less conservative reanalysis 
of these data by Villumsen (unpublished), where an attempt was made to account 
for the seeing effects, yielded a 3-a detection of a non-vanishing mean ellipticity. 

Following the suggestion that the observed large-angle QSO-galaxy associations 
are due to weak lensing by the large-scale structure in which the foreground galax- 
ies are embedded (see Sect. 7), Fortetal. (1996) searched for shear around five 
luminous radio quasars. In one of the fields, the number density of stars was so 
high that no reasonable shear measurement on faint background galaxies could be 
performed. In the remaining four QSO fields, they found a shear signal on a 
scale of ~ 1' for three of the QSOs (those which were observed with SUSI, which 
has a field-of-view of ~ 2'2), and on a somewhat larger angular scale for the fourth 
QSO. Taken at face value, these observations support the suggestion of magnifica- 
tion bias caused by the large-scale structure. A reanalysis of the three SUSI fields 
by Schneider et al. (1998b), considering the rms shear over the fields, produced a 
positive value for (|yP) at the 99% significance level, as determined by numerous 
simulations randomising the orientation angles of the galaxy ellipticities. The am- 
plitude of the rms shear, when corrected for the dilution by seeing, is of the same 
magnitude as expected from cluster-normalised models. However, if the magnifi- 
cation bias hypothesis is true, these three lines-of-sight are not randomly selected, 
and therefore this measurement is of no cosmological use. 

Of course, one or a few narrow-angle fields cannot be useful for a measurement 
of cosmic shear, owing to cosmic variance. Therefore, a meaningful measurement 
of cosmic shear must either include many small fields, or must be obtained from 
a wide-field survey. Using the first strategy, several projects are under way: The 
Hubble Space Telescope has been carrying out so-called parallel surveys, where 
one or more of the instruments not used for primary observations are switched on 
to obtain data of a field located a few arc minutes away from the primary pointing. 
Over the past few years, a considerable database of such parallel data sets has ac- 
cumulated. Two teams are currently analysing parallel data sets taken with WFPC2 
and STIS, respectively (see Seitzet al. 1998a, Rhodes etal. 1999). In addition, a 
cosmic-shear survey is currently under way, in which randomly selected areas of 



This field was subsequently used to demonstrate the superb image quality of the SUSI 
instrument on the ESO NTT. 
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the sky are mapped with the FORS instrument (~ x 6'.1) on the VLT. Some of 
these areas include the fields from the STIS parallel survey. 

The alternative approach is to map big areas and measure the cosmic shear on a 
wide range of scales. The wide-field cameras currently being developed and in- 
stalled are ideally suited for this purpose, and several groups are actively engaged 
in this work (see the proceedings of the Boston lens conference, July 1999). At 
present, no conclusive results are available, which is perhaps not too surprising 
given the smallness of the expected effect, the infancy of the research area, and the 
relatively small amount of high-quality data collected and analysed so far. Never- 
theless, upper limits on the cosmic shear have been derived by several groups which 
apparently exclude a COBE-normalised SCDM model. 

There is nothing special about weak lensing being carried out predominantly in 
the optical wavelength regime, except that the optical sky is full of faint extended 
sources, whereas the radio sky is relatively empty. The FIRST radio survey covers 
at present about 4200 square degrees and contains 4 x 10^ sources, i.e., the number 
density is smaller by about a factor ~ 1000 than in deep optical images. However, 
this radio survey covers a much larger solid angle than current or foreseeable deep 
optical surveys. As discussed in Refregier et al. (1998), this survey may yield a 
significant measurement of the two-point correlation function of image ellipticities 
on angular scales > 10'. On smaller angular scales, sources with intrinsic double- 
lobe structure cannot be separated from individual independent sources. The Square 
Kilometer Array (van Haarlem & van der Hulst 1999) currently being discussed 
will yield such a tremendous increase in sensitivity for cm-wavelength radio astron- 
omy that the radio sky will then be as crowded as the current optical sky. Finally, 
the recently commissioned Sloan telescope will map a quarter of the sky in five 
colours. Although the imaging survey will be much shallower than current weak- 
lensing imaging, the huge area surveyed can compensate for the reduced galaxy 
number density and their smaller mean redshift Stebbins et al. (1996). Indeed, first 
weak-lensing results were already reported at the Boston lensing conference (July 
1999) from commissioning data of the telescope (see also Fischer et al. 1999). 
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7 QSO Magnification Bias and Large-Scale Structure 



7.1 Introduction 



Magnification by gravitational lenses is a purely geometrical phenomenon. The 

solid angle spanned by the source is enlarged, or equivalently, gravitational focus- 
ing directs a larger fraction of the energy radiated by the source to the observer. 
Sources that would have been too faint without magnification can therefore be seen 
in a flux-limited sample. However, these sources are now distributed over a larger 
patch of the sky because the solid angle is stretched by the lens, so that the number 
density of the sources on the sky is reduced. The net effect on the number density 
depends on how many sources are added to the sample because they appear brighter. 
If the number density of sources increases steeply with decreasing flux, many more 
sources appear due to a given magnification, and the simultaneous dilution can be 
compensated or outweighed. 

This magnification bias was described in Sect. 4.4.1 (page 70) and quantified in 
eq. (4.38). As introduced there, let /j(0) denote the magnification into direction 9 
on the sky, and no(> S) the intrinsic counts of sources with observed flux exceed- 
ing S. In the limit of weak lensing, ju(0) > 1, and the flux will not change by a 
large factor, so that it is sufficient to know the behaviour of «o(> S) in a small 
neighbourhood of S. Without loss of generality, we can assume the number-count 
function to be a power law in that neighbourhood, no(> S) °^ 5^". We can safely 
ignore any redshift dependence of the intrinsic source counts here because we aim 
at lensing effects of moderate-redshift mass distributions on high-redshift sources. 
Equation (4.43, page 71) then applies, which relates the cumulative source counts 
n{> S,Q) observed in direction 6 to the intrinsic source counts, 

n(>5,e)=//«-i(e)no(>5). (7.1) 

Hence, if a > 1, the observed number density of objects is increased by lensing, and 
reduced if a < 1 . This effect is called magnification bias or magnification anti-bias 
(e.g. Schneider et al. 1992). 

The intrinsic number-count function of QSOs is well fit by a broken power law with 
a slope of a ~ 0.64 for QSOs fainter than ~ 19th blue magnitude, and a steeper 
slope of a ~ 2.52 for brighter QSOs (Boyle et al. 1988; Hartwick & Schade 1990; 
Pei 1995). Faint QSOs are therefore anti-biased by lensing, and bright QSOs are 
biased. In the neighbourhood of gravitational lenses, the number density of bright 
QSOs is thus expected to be higher than average, in other words, more bright QSOs 
should be observed close to foreground lenses than expected without lensing. Ac- 
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cording to eq. (7.1), the overdensity factor is 

«(§) = ^^ =/-"-' (5)- (7.2) 

no[> S) 

If the lenses are individual galaxies, the magnification ^(6) drops rapidly with in- 
creasing distance from the lens. The natural scale for the angular separation is the 
Einstein radius, which is of order an arc second for galaxies. Therefore, individ- 
ual galaxies are expected to increase the number density of bright QSOs only in a 
region of radius a few arc seconds around them. 
Fugmann (1990) 

reported an observation which apparently contradicts this expectation. He corre- 
lated bright, radio-loud QSOs at moderate and high redshifts with galaxies from the 
Lick catalogue (Seldner et al. 1977) and found that there is a significant overdensity 
of galaxies around the QSOs of some of his sub-samples. This is intriguing because 
the Lick catalogue contains the counts of galaxies brighter than ~ 19th magnitude 
in square-shaped cells with 10' side length. Galaxies of < 19th magnitude are typ- 
ically at much lower redshifts than the QSOs, z < 0. 1 — 0.2, so that the QSOs with 
redshifts z > 0.5 — 1 are in the distant background of the galaxies, with the two 
samples separated by hundreds of megaparsecs. Physical correlations between the 
QSOs and the galaxies are clearly ruled out. Can the observed overdensity be ex- 
pected from gravitational lensing? By construction, the angular resolution of the 
Lick catalogue is of order 10', exceeding the Einstein radii of individual galaxies 
by more than two orders of magnitude. The result that Lick galaxies are correlated 
with bright QSOs can thus neither be explained by physical correlations nor by 
gravitational lensing due to individual galaxies. 

On the other hand, the angular scale of ~ 10' is on the right order of magnitude for 
lensing by large-scale structures. The question therefore arises whether the magnifi- 
cation due to lensing by large-scale structures is sufficient to cause a magnification 
bias in flux-limited QSO samples which is large enough to explain the observed 
QSO-galaxy correlation. The idea is that QSOs are then expected to appear more 
abundantly behind matter overdensities. More galaxies are expected where the mat- 
ter density is higher than on average, and so the galaxies would act as tracers for 
the dark material responsible for the lensing magnification. This could then cause 
foreground galaxies to be overdense around background QSOs. This exciting pos- 
sibility clearly deserves detailed investigation. 

Even earlier than Fugmann, Tyson (1986) had inferred that galaxies apparently un- 
derwent strong luminosity evolution from a detection of significant galaxy over- 
densities on scales of 30" around 42 QSOs with redshifts 1 < z < 1.5, assuming 
that the excess galaxies were at the QSO redshifts. In the light of later observations 
and theoretical studies, he probably was the first to detect weak-lensing induced 
associations of distant sources with foreground galaxies. 
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7.2 Expected Magnification Bias from Cosmological Density Perturbations 



To estimate the magnitude of the effect, we now calculate the angular cross- 
correlation function ^qg((|)) between background QSOs and foreground galax- 
ies expected from weak lensing due to large-scale structures (Bartelmann 1995b; 
Dolag & Bartelmann 1997; Sanz et al. 1997). We employ a simple picture for 
the relation between the number density of galaxies and the density contrast 
of dark matter, the linear biasing scheme (e.g. Kaiser 1984; Bardeen et al. 1986; 
White et al. 1987). Within this picture, and assuming weak lensing, we shall im- 
mediately see that the desired correlation function ^qg is proportional to the cross- 
correlation function ^^g between magnification /j and density contrast 5. The latter 
correlation can straightforwardly be computed with the techniques developed pre- 
viously. 



7.2.1 QSO-Galaxy Correlation Function 

The angular cross-correlation function ^qg(<|>) between galaxies and QSOs is de- 
fined by 

(7.3) 



^gq(^) = -JJ^^^ { ["q(6) - ("q) J ["G(e + ^) - (no) 

where (^q.g) are the mean number densities of QSOs and galaxies averaged over 
the whole sky. Assuming isotropy, ^qg((|>) does not depend on the direction of the 
lag angle All number densities depend on flux (or galaxy magnitude), but we 
leave out the corresponding arguments for brevity. 

We saw in eq. (7.1) in the introduction that nQ(6) = ;U°'~^(0) (hq). Since the mag- 
nification expected from large-scale structures is small, // = 1-1-5// with |5//| <^ 1, 
we can expand 1 -I- (a — l)5/i. Hence, we can approximate 

!^2i^)_M«(„_ 1)5^(0), (7.4) 

{no} 

so that the relative fluctuation of the QSO number density is proportional to the 
magnification fluctuation, and the factor of proportionality quantifies the magnifi- 
cation bias. Again, for a = 1, lensing has no effect on the number density. 

The linear biasing model for the fluctuations in the galaxy density asserts that the 
relative fluctuations in the galaxy number counts are proportional to the density 
contrast 6, 

"°(«)-<"°> =fc8(e). (7.5) 

(«g) 
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where 5(0) is the line-of-sight integrated density contrast, weighted by the galaxy 
redshift distribution, i.e. the w-integral in eq. (6.65), page 149. The proportionality 
factor b is the effective biasing factor appropriately averaged over the line-of-sight. 
Typical values for the biasing factor are assumed to be > 1—2. Both the relative 
fluctuations in the galaxy number density and the density contrast are bounded by 
—1 from below, so that the right-hand side should be replaced by max[&5(9), —1] 
in places where 5(6) < —b~^. For simplicity we use (7.5), keeping this limitation 
in mind. 

Using eqs. (7.4) and (7.5), the QSO-galaxy cross-correlation function (7.3) be- 
comes 

^qg(^) = (a - 1)^7(5^(0)6(0 + $)) . (7.6) 

Hence, it is proportional to the cross-correlation function between magnifica- 
tion and density contrast, and the proportionality factor is given by the steepness 
of the intrinsic QSO number counts and the bias factor (Bartelmann 1995b). As 
expected from the discussion of the magnification bias, the magnification bias is 
ineffective for a = 1, and QSOs and galaxies are anti-correlated for a < 1. Fur- 
thermore, if the number density of galaxies does not reflect the dark-matter fluc- 
tuations, b would vanish, and the correlation would disappear. In order to find the 
QSO-galaxy cross -correlation function, we therefore have to evaluate the angular 
cross-correlation function between magnification and density contrast. 



7.2.2 Magnification-Density Correlation Function 

We have seen in Sect. 6 that the magnification fluctuation is twice the effective 
convergence 5/i(0) = 2Kefr(0) in the limit of weak lensing, see eq. (6.29, page 128). 

The latter is given by eq. (6.19, page 124), in which the average over the source- 
distance distribution has already been performed. Therefore, we can immediately 
write down the source-distance averaged magnification fluctuation as 

5^(0) = — ^ / dwWQ{w)fK{w) ^ . (7.7) 

Jo a{w) 

Here, Wq(w) is the modified QSO weight function 

Wq(w) ^ / dw'GQ(w') ^^^f—^ , (7.8) 
and Gq{w) is the normalised QSO distance distribution. 

Both the average density contrast 6 and the average magnification fluctuation 5// 
are weighted projections of the density fluctuations along the line-of-sight, which 
is assumed to be a homogeneous and isotropic random field. As in the derivation 
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of the effective-convergence power spectrum in Sect. 6, we can once more employ 
Limber's equation in Fourier space to find the cross power spectrum /'^5(/) for 
projected magnification and density contrast, 

^ Jo a{w)fK[w) \fK[w)J 

The cross-correlation function between magnification and density contrast is ob- 
tained from eq. (7.9) via Fourier transformation, which can be carried out and sim- 
plified to yield 



Art y) /"^H 

^A'SW^-^ dw' fK{w')W^{w')GG{w')a-\w') 

X / ^P?>{k,w')}o[fK{w')m. (7.10) 

Quite obviously, there is a strong similarity between this equation and that for the 
magnification autocorrelation function, eq. (6.34, page 130). We note that eq. (7.10) 
automatically accounts for galaxy autocorrelations through the matter power spec- 
trum P§ (A:). 



7.2.3 Distance Distributions and Weight Functions 

The QSO and galaxy weight functions Gq^q{w) are normalised representations of 
their respective redshift distributions, where the redshift needs to be transformed to 
comoving distance w. 

The redshift distribution of QSOs has frequently been measured and parameterised. 
Using the functional form and the parameters determined by Pei (1995), the mod- 
ified QSO weight function Wq(w) has the shape illustrated in the top panel of 
Fig. 25. It is necessary for our present purposes to be able to impose a lower red- 
shift limit on the QSO sample. Since we want to study lensing-induced correlations 
between background QSOs and foreground galaxies, there must be a way to ex- 
clude QSOs physically associated with galaxy overdensities. This is observation- 
ally achieved by choosing a lower QSO redshift cut-off high enough to suppress any 
redshift overlap between the QSO and galaxy samples. This procedure must be re- 
produced in theoretical calculations of the QSO-galaxy cross-correlation function. 
This can be achieved by cutting off the observed redshift distribution Gq below 
some redshift zq, re-normalising it, and putting the result into eq. (7.8) to find Wq. 
The five curves shown in the top panel of Fig. 25 are for cut-off redshifts zq increas- 
ing from 0.0 (solid curve) to 2.0 in steps of 0.5. Obviously, the peak in Wq shifts to 
larger w for increasing zq. 

Galaxy redshift distributions Gq can be obtained by extrapolating local galaxy 
samples to higher redshifts, adopting a constant comoving number density and a 
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Fig. 25. QSO and galaxy weight functions, Wq(w) and Gg(w), respectively. Top panel: 
Wq (w) for five different choices of the lower cut-off redshift zo imposed on the QSO sam- 
ple; zo increases from 0.0 (solid curve) to 2.0 in steps of 0.5. The peak in Wq{w) shifts to 
larger distances for increasing zo- Bottom panel: Gg{w) for five different galaxy magnitude 
limits mo, increasing from 18.5 to 22.5 (solid curve) in steps of one magnitude. The peak in 
the galaxy distance distribution shifts towards larger distances with increasing mo, i.e. with 
decreasing brightness of the galaxy sample. 

Schechter-type luminosity function. For the present purposes, this is a safe proce- 
dure because the galaxies to be correlated with the QSOs should be at sufficiently 
lower redshifts than the QSOs to avoid overlap between the samples. Thus the ex- 
trapolation from the local galaxy population is well justified. In order to convert 
galaxy luminosities to observed magnitudes, A:-corrections need to be taken into 
account. Conveniently, the resulting weight functions should be parameterised by 
the brightness cut-off of the galaxy sample, in practice by the maximum galaxy 
magnitude mo (i.e. the minimum luminosity) required for a galaxy to enter the 
sample. The five representative curves for Gg{w) in the lower panel of Fig. 25 are 
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for mo increasing from 18.5 to 22.5 (solid curve) in steps of one magnitude. R- 
band magnitudes are assumed. For increasing cut-off magnitude mo, i.e. for fainter 
galaxy samples, the distributions broaden, as expected. The correlation amplitude 
as a function of mo peaks if mo is chosen such that the median distance to the 
galaxies is roughly half the distance to the bulk of the QSO population considered. 

7.2.4 Simplifications 

It turns out in practice that the exact shapes of the QSO and galaxy weight functions 
Wq{w) and Gg(>v) are of minor importance for the results. Allowing inaccuracies 
of order 10%, we can replace the functions Gq^q{w) by delta distributions centred 
on typical QSO and galaxy distances wq and wq < wq. Then, from eq. (7.8), 

Wqiw) = ^^[^Q- ^) h(wq-w) , (7.11) 

where H(a:) is the Heaviside step function, and the line-of-sight integration in 
eq. (7.7) becomes trivial. It is obvious that matter fluctuations at redshifts higher 
than the QSO redshift do not contribute to the cross-correlation function ^^s((|)): 
Inserting (7.11) together with Gq — 5(w — wg) into eq. (7.10), we find ^^5(([)) = 
if wq>wq, as it should be. 

The expression for the magnification-density cross-correlation function further sim- 
plifies if we specialise to a model universe with zero spatial curvature, K — 0, such 
that fxiw) ~ w. Then, 

Wq(w)= fl-— ^ H(wQ-w), (7.12) 
and the cross-correlation function ^^5((|)) reduces to 

for Wq > Wq, and '^^^{^) = otherwise. 
7.3 Theoretical Expectations 
7.3.1 Qualitative Behaviour 

Before we evaluate the magnification-density cross-correlation function fully nu- 
merically, we can gain some insight into its expected behaviour by inserting the 
CDM and HDM model spectra defined in eq. (6.36, page 131) into eq. (7.10) and 
expanding the result into a power series in ^ (Bartelmann 1995b). As in the case 
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of the magnification auto-correlation function before, the two model spectra pro- 
duce qualitatively different results. To first order in (|), ^^s((|)) decreases linearly 
with increasing ^ for CDM, while it is flat for HDM. The reason for this different 
appearance is the lack of small-scale power in HDM, and the abundance thereof 
in CDM. The two curves shown in Fig. 26 illustrate this for an Einstein-de Sitter 
universe with Hubble constant h = 0.5. The underlying density-perturbation power 
spectra were normalised by the local abundance of rich clusters, and linear density 
evolution was assumed. 
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Fig. 26. Cross-correlation functions between magnification and density contrast, ((])), 
are shown for an Einstein-de Sitter universe with h = 0.5, adopting CDM (solid curve) 
and HDM (dotted curve) density fluctuation spectra. Both spectra are normalised to the 
local cluster abundance, and linear density evolution is assumed. The lower cut-off redshift 
of the QSOs is zo = 0.3, the galaxy magnitude limit is mo = 20.5. In agreement with the 
expectation derived from the CDM and HDM model spectra (6.36, page 131), the CDM 
cross-correlation function decreases linearly with increasing (|) for small (|), while it is flat to 
first order in (|) for HDM. The small-scale matter fluctuations in CDM compared to HDM 
cause ^^§((|)) to increase more steeply as (|) — >^ 0. 

The linear correlation amplitude, ^^§(0), for CDM is of order 3 x 10^^, and about 
a factor of five smaller for HDM. The magnification-density cross-correlation func- 
tion for CDM drops to half its peak value within a few times 10 arc minutes. This, 
and the monotonic increase of towards small (]), indicate that density perturba- 
tions on angular scales below 10' contribute predominantly to ^^§. At typical lens 
redshifts, such angular scales correspond to physical scales up to a few Mpc. Ev- 
idently therefore, the non-linear evolution of the density perturbations needs to be 
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taken into account, and its effect is expected to be substantial. 

7.5.2 Results 

Figure 27 confirms this expectation; it shows magnification-density cross- 
correlation functions for the four cosmological models detailed in Tab. 1 on 
page 117. Two curves are shown for each model, one for linear and the other for 
non-linear density evolution. The two curves of each pair are easily distinguished 
because non-linear evolution increases the cross-correlation amplitude at small (|) by 
about an order of magnitude above linear evolution, quite independent of the cos- 
mological model. At the same time, the angular cross-correlation scale is reduced 
to a few arc minutes. At angular scales < 30', the non-linear cross-correlation func- 
tions are above the linear results, falling below at larger scales. The correlation 
functions for the three cluster-normaUsed models (SCDM, OCDM and ACDM; 
see Tab. 1 on page 117) are very similar in shape and amplitude. The curve for 
the oCDM model lies above the other curves by a factor of about five, but for 
low-density universes, the influence of different power-spectrum normalisations are 
much less prominent. 

The main results to be extracted from Fig. 27 are that the amplitude of the 
magnification-density cross-correlation function, ^^5(0), reaches approximately 
5 X 10"-^, and that drops by an order of magnitude within about 20'. This 
behaviour is quite independent of the cosmological parameters if the density- 
fluctuation power spectrum is normalised by the local abundance of rich galaxy 
clusters. More detailed results can be found in Dolag &Bartelmann (1997) and 
Sanz et al. (1997). 

7.3.3 Signal-to-Noise Estimate 

The QSO-galaxy correlation function ^qg((|)) is larger than ^^§((|)) by the factor 
(a — l)b. The value of the bias factor b is yet unclear, but it appears reasonable to 
assume that it is between 1 and 2. For optically selected QSOs, a ~ 2.5, so that 
(a— l)l?~2 — 3. Combining this with the correlation amplitude for CDM read off 
from Fig. 27, we can expect ^qg(O) ^ O-l- 

Given the meaning of ^qg((|)), the probability to find a foreground galaxy close to 
a background QSO is increased by a factor of [1 4-^qg(([>)] ^1-1 above random. 
In a small solid angle d^CO around a randomly selected background QSO, we thus 
expect to find 

A^G ~ [1 + ^Qg(0)] {ug) d^to = [1 + ^qg(0)] (Ng) (7.14) 

galaxies, where (Nq) is the average number of galaxies within a solid angle of d^co. 
In a sample of A'q fields around randomly selected QSOs, the signal-to-noise ratio 
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Fig. 27. Angular magnification-density cross-correlation functions ^^5((|)) are shown for the 
four cosmological models specified in Table 1 on page 117. Two curves are shown for each 
cosmological model; those with the higher (lower) amplitude at (|) = were calculated with 
the non-linearly (linearly) evolving density-perturbation power spectra, respectively. The 
models are: SCDM (solid curves), aCDM (dotted curves), OCDM (short-dashed curves), 
and ACDM (long-dashed curves). Obviously, non-linear evolution has a substantial effect. 
It increases the correlation amplitude by about an order of magnitude. The Einstein-de Sit- 
ter model normalised to ag = 1 has a significantly larger cross-correlation amplitude than 
the cluster-normalised Einstein-de Sitter model. For the low-density models, the difference 
is much smaller. The curves for the cluster-normalised models are very similar, quite inde- 
pendent of cosmological parameters. 



for the detection of a galaxy overdensity is then 

(A^q(^g))^/'^qg(0). (7.15) 



S ^A^q(A^g-(A^g)) .1/2, 



N {NQ{NG)y/^ 

Typical surface number densities of reasonably bright galaxies are of order hq 
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10 per square arc minute. Therefore, there should be of order (A^g) ~ 30 galaxies 
within a randomly selected disk of one arc minute radius, in which the QSO-galaxy 
cross correlation is sufficiently strong. If we require a certain minimum signal-to- 
noise ratio such that S/N > (S/N)o, the number of QSO fields to be observed in 
order to meet this criterion is 

where we have inserted typical numbers in the last step. This estimate demon- 
strates that gravitational lensing by non-linearly evolving large-scale structures in 
cluster-normalised CDM can produce correlations between background QSOs and 
foreground galaxies at the 5 o level on arc minute scales in samples of > 20 QSOs. 
The angular scale of the correlations is expected to be of order 1 to 10 arc min- 
utes. Equation (7.16) makes it explicit that more QSO fields need to be observed 
in order to establish the significance of the QSO-galaxy correlations if (i) the QSO 
number count function is shallow (a close to unity), and (ii) the galaxy bias factor 
b is small. In particular, no correlations are expected if a = 1, because then the 
dilution of the sources and the increase in QSO number exactly cancel. Numerical 
simulations (Bartelmann 1995b) confirm the estimate (7.16). 
Fugmann 

's (1990) observation was also tested in a numerical model universe based on the ad- 
hesion approximation to structure formation (Bartelmann & Schneider 1992). This 
model universe was populated with QSOs and galaxies, and QSO-galaxy correla- 
tions on angular scales on the order of ~ 10' were investigated using Spearman's 
rank-order correlation test (Bartelmann & Schneider 1993a). Light propagation in 
the model universe was described with the multiple lens-plane approximation of 
gravitational lensing. In agreement with the analytical estimate presented above, it 
was found that lensing by large-scale structures can indeed account for the observed 
correlations between high-redshift QSOs and low-redshift galaxies, provided the 
QSO number-count function is steep. Lensing by individual galaxies was confirmed 
to be entirely negligible. 

7. 3. 4 Multiple- Waveband Magnification Bias 

The magnification bias quantified by the number-count slope a can be substantially 
increased if QSOs are selected in two or more mutually uncorrelated wave bands 
rather than one (Borgeest et al. 1991). To see why, suppose that optically bright 
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and radio-loud QSOs were selected, and that their fluxes in the two wave bands are 
uncorrelated. Let 5i 2 be the flux thresholds in the optical and in the radio regimes, 
respectively, and ni^2 the corresponding number densities of either optically bright 
or radio-loud QSOs on the sky. As in the introduction, we assume that ni^2 can be 
written as power laws in 51,2, with exponents ai,2. 

In a small solid angle d^(0, the probability to find an optically bright or radio-loud 
QSO is then pi{Si) — ni{Si) d^CO, and the joint probability to find an optically bright 
and radio-loud QSO is the product of the individual probabilities, or 

p{Si,S2)=pi{Si)p2{S2) = [ni{Si)n2{S2)] d^co - 5-^^52 d^co , (7.17) 

provided there is no correlation between the fluxes ^1^2 so that the two probabilities 
are independent. Suppose now that lensing produces a magnification factor // across 
d^to. The joint probability is then changed to 

P'(Si,S2)ocl^^y (|y'^=^«'+«-V(5i,52). (7.18) 

Therefore, the magnification bias in the optically bright and radio-loud QSO sam- 
ple is as efficient as if the number-count function had a slope of a = tti -|- a2. 

More generally, the effective number-count slope for the magnification bias in a 
QSO sample that is flux limited in m mutually uncorrelated wave bands is 

m 

a=l^ai, (7.19) 

i=i 

where a, are the number-count slopes in the individual wave bands. Then, the QSO- 
galaxy cross-correlation function is 

^Qg(^) = (^t - 1 j ^^^8(^) , (V.20) 

and can therefore be noticeably larger than for a QSO sample which is flux limited 
in one wave band only. 

7.4 Observational Results 

After this theoretical investigation, we turn to observations of QSO-galaxy cross- 
correlations on large angular scales. The existence of QSO-galaxy correlations was 
tested and verified in several studies using some very different QSO- and galaxy 
samples. 

Bartelmann & Schneider (1993b) 
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repeated Fugmann's analysis with a well-defined sample of background 
QSOs, namely the optically identified QSOs from the 1-Jansky catalogue 
(Kuhretal. 1981; Stickeletal. 1993; Stickel & Kuhr 1993). Optically identified 
QSOs with measured redshifts need to be bright enough for detection and spec- 
troscopy, hence the chosen sample is implicitly also constrained by an optical flux 
limit. Optical and radio QSO fluxes are generally not strongly correlated, so that 
the sample is affected by a double- waveband magnification bias, which can further 
be strengthened by explicitly imposing an optical flux (or magnitude) limit. 

Although detailed results differ from Fugmann's, the presence of the correlation is 
confirmed at the 98% confidence level for QSOs with redshifts > 0.75 and brighter 
than 18th magnitude. The number of QSOs matching these criteria is 56. The cor- 
relation significance decreases both for lower- and higher-redshift QSO samples, 
and also for optically fainter ones. This is in accordance with an explanation in 
terms of a (double-waveband) magnification bias due to gravitational lensing. For 
low-redshift QSOs, lensing is not efficient enough to produce the correlations. For 
high-redshift QSOs, the most efficient lenses are at higher redshifts than the galax- 
ies, so that the observed galaxies are uncorrelated with the structures which mag- 
nify the QSOs. Hence, the correlation is expected to disappear for increasing QSO 
redshifts. For an optically unconstrained QSO sample, the effective slope of the 
number-count function is smaller, reducing the strength of the magnification bias 
and therefore also the significance of the correlation. 

With a similar correlation technique, correlations between the 1-Jansky QSO sam- 
ple and IRAS galaxies (Bartelmann & Schneider 1994) and diffuse X-ray emission 
(Bartelmann et al. 1994) were investigated, leading to qualitatively similar results. 
IRAS galaxies are correlated with optically bright, high-redshift z > 1.5 1-Jansky 
sources at the 99.8% confidence level. The higher QSO redshift for which the cor- 
relation becomes significant can be understood if the IRAS galaxy sample is deeper 
than the Lick galaxy sample, so that the structures responsible for the lensing can 
be traced to higher redshift. 
Bartsch et al. (1997) 

re-analysed the correlation between IRAS galaxies and 1-Jansky QSOs using 
a more advanced statistical technique which can be optimised to the correla- 
tion function expected from lensing by large-scale structures. In agreement with 
Bartelmann & Schneider (1994), they found significant correlations between the 
QSOs and the IRAS galaxies on angular scales of ~ 5', but the correlation am- 
plitude is higher than expected from large-scale structure lensing, assuming lin- 
ear evolution of the density-perturbation power spectrum. Including non-linear 
evolution, however, the results by Bartsch et al. (1997) can well be reproduced 
(Dolag & Bartelmann 1997). 

X-ray photons from the ROSAT All-Sky Survey (e.g. Voges 1992) are correlated 
with optically bright 1-Jansky sources both at low (0.5 <z< 1.0) and at high red- 
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shifts (1.5 < z < 2.0), but there is no significant correlation with QSOs in the inter- 
mediate redshift regime. A plausible explanation for this is that the correlation of 
X-ray photons with low-redshift 1-Jansky QSOs is due to hot gas which is phys- 
ically associated with the QSOs, e.g. which resides in the host clusters of these 
QSOs. Increasing the source redshift, the flux from these clusters falls below the 
detection threshold of the All-Sky Survey, hence the correlation disappears. Upon 
further increasing the QSO redshift, lensing by large-scale structures becomes effi- 
cient, and the X-ray photons trace hot gas in the lenses. 
Rodrigues-Williams & Hogan (1994) 

found a highly significant correlation between optically-selected, high-redshift 
QSOs and Zwicky clusters. Their cluster sample was fairly bright, which indicates 
that the clusters are in the foreground of the QSOs. This rules out that the clusters 
are physically associated with the QSOs and thus exert environmental effects on 
them which might lead to the observed association. Rodrigues-Williams & Hogan 
discussed lensing as the most probable reason for the correlations, although 
simple mass models for the clusters yield lower magnifications than required 
to explain the significance of the effect. Seitz & Schneider (1995b) repeated 
their analysis with the 1-Jansky sample of QSOs. They found agreement with 
Rodrigues-Williams & Hogan's result for intermediate-redshift (z ~ 1) QSOs, but 
failed to detect significant correlations for higher-redshift sources. In addition, a 
significant under-density of low-redshift QSOs close to Zwicky clusters was found, 
for which environmental effects like dust absorption are the most likely explana- 
tion. A variability-selected QSO sample was correlated with Zwicky clusters by 
Rodrigues-Williams & Hawkins (1995). They detected a significant correlation be- 
tween QSOs with 0.4 < z < 2.2 with foreground Zwicky clusters (with (z) ~ 0. 15) 
and interpreted it in terms of gravitational lensing. Again, the implied average QSO 
magnification is substantially larger than that inferred from simple lens models for 
clusters with velocity dispersions of ~ 10^ kms^^ Wu & Han (1995) searched for 
associations between distant 1-Jansky and 2-Jansky QSOs and foreground Abell 
clusters. They found no correlations with the 1-Jansky sources, and a marginally 
significant correlation with 2-Jansky sources. They argue that lensing by individual 
clusters is insufficient if cluster velocity dispersions are of order 10^ kms~\ and 
that lensing by large-scale structures provides a viable explanation. 
Bemtez & Martmez-Gonzalez (1995) 

found an excess of red galaxies from the APM catalog with moderate-redshift 
(z ~ 1) 1-Jansky QSOs on angular scales < 5' at the 99. 1% significance level. Their 
colour selection ensures that the galaxies are most likely at redshifts 0.2 <z< 0.4, 
well in the foreground of the QSOs. The amplitude and angular scale of the excess 
is compatible with its originating from lensing by large-scale structures. The mea- 
surements by Benftez & Martmez-Gonzalez (1995) are plotted together with var- 
ious theoretical QSO-galaxy cross-correlation functions in Fig. 28, which clearly 
shows that the QSO-galaxy cross-correlation measurements agree quite well with 
the cross-correlation functions ^qg((|)), but they fall above the range of theoretical 
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Fig. 28. QSO-galaxy cross-correlation measurements are plotted together with theoreti- 
cal cross-correlation functions ^qg(<1)) for various cosmological models as indicated by 
line type. The CDM density-perturbation power spectrum was cluster-normalised, and 
non-linear evolution was taken into account. The figure shows that the measurements fall 
above the theoretical predictions at small angular scales, (|) < 2'. This excess can be at- 
tributed to gravitational lensing by individual galaxy clusters (see the text for more de- 
tail). The theoretical curves depend on the Hubble constant h through the shape parameter 
r = Q.oh, which determines the peak location of the power spectrum. 



predictions at small angular scales, (|) < 2'. This can be attributed to the magnifi- 
cation bias due to gravitational lensing by individual clusters. Being based on the 
weak-Iensing approximation, our approach breaks down when the magnification 
becomes comparable to unity, > 1 .5, say. This amount of magnification occurs for 
QSOs closer than ~ 3 Einstein radii to cluster cores. Depending on cosmological 
parameters, QSO and galaxy redshifts, ~ 3 Einstein radii correspond to ~ 1' — 2'. 
Hence, we expect the theoretical expectations from lensing by large-scale structures 
alone to fall below the observations on angular scales (|) < I' — 2' . 
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Norman & Impey (1999) 



took wide-field R-band images centred on a subsample of 1-Jansky QSOs with 
redshifts between 1 and 2. They searched for an excess of galaxies in the magnitude 
range 19.5 < < 21 on angular scales of > 10' around these QSOs and found a 
correlation at the 99% significance level. The redshift distribution of the galaxies is 
likely to peak around z ~ 0.2. The angular cross-correlation function between the 
QSOs and the galaxies agrees well with the theoretical expectations, although the 
error bars are fairly large. 

All these results indicate that there are correlations between background QSOs and 
foreground 'light', with light either in the optical, the infrared, or the (soft) X-ray 
wave bands. The angular scale of the correlations is compatible with that expected 
from lensing by large-scale structures, and the amplitude is either consistent with 
that explanation or somewhat larger. Wu & Fang (1996) discussed whether the au- 
tocorrelation of clusters modelled as singular isothermal spheres can produce suf- 
ficient magnification to explain this result. They found that this is not the case, and 
argued that large-scale structures must contribute substantially. 

If lensing is indeed responsible for the correlations detected, other signatures of 
lensing should be found in the vicinity of distant QSOs. Indeed, Fort et al. (1996) 
searched for the shear induced by weak lensing in the fields of five luminous 
QSOs with z ?si 1 and found coherent shear signals in four of them (see also 
Schneider et al. 1998b). In addition, they detected galaxy groups in three of their 
fields. Earlier, Bonnet et al. (1993) had found evidence for coherent weak shear in 
the field of the potentially multiply-imaged QSO 2345-1-007, which was later iden- 
tified with a distant cluster (MeUier et al. 1994; Fischer et al. 1994). 
Bower & Small (1997) 

searched for weak-lensing signals in fields around eight luminous radio sources 
at redshifts ~ 1. They confirmed the coherent shear detected earlier by 
Fort et al. (1996) around one of the sources (3C336 at z = 0.927), but failed to 
find signatures of weak lensing in the combined remaining seven fields. 

A cautionary note was recently added to this discussion by 
Williams & Irwin (1998) and Norman & Williams (1999). Cross-correlating 
LBQS and 1-Jansky quasars with APM galaxies, they claimed significant galaxy 
overdensities around QSOs on angular scales of order one degree. As discussed 
above, lensing by currently favoured models of large-scale structures is not able 
to explain such large correlation scales. Thus, if these results hold up, they would 
provide evidence that there is a fundamental difficulty with the current models of 
large-scale structure formation. 
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7.5 Outlook 



Cross correlations between distant QSOs and foreground galaxies on angular scales 
of about ten arc minutes have been observed, and they can be attributed to the 
magnification bias due to gravitational lensing by large-scale structures. Coherent 
shear patterns have been detected around QSOs which are significantly correlated 
with galaxies. The observations so far are in reasonable agreement with theoretical 
expectations, except for the higher observed signal in the innermost few arc min- 
utes, and the claimed correlation signal on degree scales. While the excess cross- 
correlation on small scales can be understood by the lensing effects of individual 
galaxy clusters, correlations on degree scales pose a severe problem for the lensing 
explanation if they persist, because the lensing-induced cross-correlation quickly 
dies off beyond scales of approximately 10'. 

QSO-galaxy cross -correlations have the substantial advantage over other diagnos- 
tics of weak lensing by large-scale structures that they do not pose any severe ob- 
servational problems. In particular, it is not necessary to measure either shapes or 
sizes of faint background galaxies accurately, because it is sufficient to detect and 
count comparatively bright foreground galaxies near QSOs. However, such count- 
ing requires homogeneous photometry, which is difficult to achieve in particular on 
photographic plates, and requires careful calibration. 

Since the QSO-galaxy cross-correlation function involves filtering the density- 
perturbation power spectrum with a fairly broad function, the zeroth-order Bessel 
function Jo(x) [cf. eq. (7.10)], these correlations are not well suited for constrain- 
ing the power spectrum. If the cluster normalisation is close to the correct one, the 
QSO-galaxy cross-correlation function is also fairly insensitive to cosmological pa- 
rameters. 

Rather, QSO-galaxy cross correlations are primarily important for measuring the 
bias parameter b. The rationale of future observations of QSO-galaxy correlations 
should therefore be to accurately measure the correlation amplitude on scales be- 
tween a few and 10 arc minutes. On smaller scales, the influence of individual 
galaxy clusters sets in, and on larger scales, the correlation signal is expected to be 
weak. Once it becomes possible to reliably constrain the density-fluctuation power 
spectrum, such observations can then be used to quantify the bias parameter, and 
thereby provide most valuable information for theories of galaxy formation. A pos- 
sible dependence of the bias parameter on scale and redshift can also be extracted. 

Sufficiently large data fields for this purpose will soon become available, in partic- 
ular through wide-field surveys like the 2dF Survey (CoUess 1998) and the Sloan 
Digital Sky Survey (Gunn & Knapp 1993, Loveday & Pier 1998). It therefore ap- 
pears feasible that within a few years weak lensing by large-scale structures will 
be able to quantify the relation between the distributions of galaxies and the dark 
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matter. 
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8 Galaxy-Galaxy Lensing 



8.1 Introduction 



Whereas the weak lensing techniques described in Sect. 5 are adequate to map the 
projected matter distribution of galaxy clusters, individual galaxies are not suffi- 
ciently massive to show up in the distortion of the images of background galaxies. 
From the signal-to-noise ratio (4.55, page 75) we see that individual isothermal 
haloes with a velocity dispersion in excess of ~ 600kms~^ can be detected at 
a high significance level with the currently achievable number densities of faint 
galaxy images. Galaxies have haloes of much lower velocity dispersion: The ve- 
locity dispersion of an L* elliptical galaxy is ~ 220kms~\ that of an L* spiral 
~ 145kms~\ 

However, if one is not interested in the mass properties of individual galaxies, but 
instead in the statistical properties of massive haloes of a population of galaxies, 
the weak lensing effects of several such galaxies can statistically be superposed. 
For example, if one considers A'f identical foreground galaxies, the signal-to-noise 

1/2 

ratio of the combined weak lensing effect increases as A^^ ' , so that for a typical ve- 
locity dispersion for spiral galaxies of Oy ~ 160kms~\ a few hundred foreground 
galaxies are sufficient to detect the distortion they induce on the background galaxy 
images. 

Of course, detection alone does not yield new insight into the mass properties of 

galaxy haloes. A quantitative analysis of the lensing signal must account for the 
fact that 'identical' foreground galaxies cannot be observed. Therefore, the mass 
properties of galaxies have to be parameterised in order to allow the joint analysis 
of the foreground galaxy population. In particular, one is interested in the velocity 
dispersion of a typical (L^, say) galaxy. Furthermore, the rotation curves of (spiral) 
galaxies which have been observed out to ~ 30h~^ kpc show no hint of a truncation 
of the dark halo out to this distance. Owing to the lack of dynamical tracers, with the 
exception of satellite galaxies (Zaritsky & White 1994), a direct observation of the 
extent of the dark halo towards large radii is not feasible with conventional methods. 
The method described in this section uses the light bundles of background galaxies 
as dynamical tracers, which are available at all distances from the galaxies' centres, 
and are therefore able, at least in principle, to probe the size (or the truncation 
radius) of the haloes. Methods for a quantitative analysis of galaxy haloes will be 
described in Sect. 8.2. 

The first attempt at detecting this galaxy-galaxy lensing effect was reported by 
Tyson et al. (1984), but the use of photographic plates and the relatively poor seeing 
prevented them from observing a galaxy-galaxy lensing signal. The first detection 
was reported by Brainerd et al. (1996), and as will be described in Sect. 8.3, several 
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further observational results have been derived. 

Gravitational light deflection can also be used to study the dark matter haloes of 
galaxies in clusters. The potential influence of the environment on the halo proper- 
ties of galaxies can provide a strong hint on the formation and lifetimes of clusters. 

One might expect that galaxy haloes are tidally stripped in clusters and therefore 
physically smaller than those of field galaxies. In Sect. 8.4, we consider galaxy- 
galaxy lensing in clusters, and report on some first results. 



8.2 The Theory of Galaxy-Galaxy Lensing 



A light bundle from a distant galaxy is affected by the tidal field of many foreground 
galaxies. Therefore, in order to describe the image distortion, the whole population 
of foreground galaxies has to be taken into account. But first we shall consider the 
simple case that the image shape is affected (mainly) by a single foreground galaxy. 
Throughout this section we assume that the shear is weak, so that we can replace 
(4.12, page 61) by 

e(^) = e-Y. (8.1) 

Consider an axi-symmetric mass distribution for the foreground galaxy, and back- 
ground images at separation 6 from its centre. The expectation value of the image 
ellipticity then is the shear at 0, which is oriented tangentially. If p{z) and p^^^ (e^^^) 
denote the probability distributions of the image and source ellipticities, then ac- 
cording to (8.1), 

p(8) = p(^) (8 - Y) = p(^) (e) - yo.^J>^^ (e) . (8-2) 

where the second equality applies for |y| <^ 1. If (p is the angle between the major 
axis of the image ellipse and the line connecting source and lens centre, one finds 
the probability distribution of (p by integrating (8.2) over the modulus of 8, 

p{(S?) = I d|8||8|/7(8) = i^-YtCos(2(p)i^y' d|8|/7(^)(8), (8.3) 

where cp ranges within [0,2;r]. Owing to the symmetry of the problem, we can 
restrict (p to within and 7c/2, so that the probability distribution becomes 
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l-Yt(^)cos(2(p) 



(8.4) 



i.e., the probability distribution is skewed towards values larger than 7i/4, showing 
preferentially a tangential alignment. 
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Lensing by additional foreground galaxies close to the line-of-sight to the back- 
ground galaxy does not substantially change the probability distribution (8.4). First 
of all, since we assume weak lensing throughout, the effective shear acting on a 
light bundle can well be approximated by the sum of the shear contributions from 
the individual foreground galaxies. This follows either from the linearity of the 
propagation equation in the mass distribution, or from the lowest-order approxima- 
tion of multiple-deflection gravitational lensing (e.g., Blandford & Narayan 1986; 
Seitz & Schneider 1992). Second, the additional lensing galaxies are placed at ran- 
dom angles around the line-of-sight, so that the expectation value of their com- 
bined shear averages to zero. Whereas they slightly increase the dispersion of the 
observed image ellipticities, this increase is negligible since the dispersion of the 
intrinsic ellipticity distribution is by far the dominant effect. However, if the lens 
galaxy under consideration is part of a galaxy concentration, such as a cluster, the 
surrounding galaxies are not isotropically distributed, and the foregoing argument 
is invalid. We shall consider galaxy-galaxy lensing in clusters in Sect. 8.4, and as- 
sume here that the galaxies are generally isolated. 

For an ensemble of foreground-background pairs of galaxies, the probability distri- 
bution for the angle cp simply reads 



= I 



l-(Yt)(^>cos(2(p) 



(8.5) 



where (yt) is the mean tangential shear of all pairs considered. The function /?(cp) is 
an observable. A significant deviation from a uniform distribution signals the pres- 
ence of galaxy-galaxy lensing. To obtain quantitative information on the galaxy 
haloes from the amplitude of the cosine term, one needs to know (l/£*^^)). It can 
directly be derived from observations because the weak shear assumed here does 
not significantly change this average between source and image ellipticities, from 
a parameterised relation between observable galaxy properties, and from the mean 
shear (yt) . Although in principle fine binning in galaxy properties (like colour, red- 
shift, luminosity, morphology) and angular separation of foreground-background 
pairs is possible in order to probe the shear as a function of angular distance from 
a well-defined set of foreground galaxies and thus to obtain its radial mass pro- 
file without any parameterisation, this approach is currently unfeasible owing to 
the relatively small fields across which observations of sufficient image quality are 
available. 



A convenient parameterisation of the mass profile is the truncated isothermal sphere 
with surface mass density 

where s is the truncation radius. This is a special case of the mass distribution 
(3.20, page 51). Brainerd et al. (1996) showed that this mass profile corresponds to 
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a physically realisable dark-matter particle distribution.[3 The velocity dispersion 
is assumed to scale with luminosity according to (2.68, page 34), which is supported 
by observations. A similar scaling of s with luminosity L or velocity dispersion Ov 
is also assumed, 




(8.7) 



where the choice of the exponent is largely arbitrary. The scaling in (8.7) is such that 
the ratio of truncation radius and Einstein radius at fixed redshift is independent of 
L. If, in addition, a = 4, the total mass-to-light ratio is identical for all galaxies. The 
fiducial luminosity may depend on redshift. For instance, if the galaxies evolve 
passively, their mass properties are unaffected, but aging of the stellar population 
cause them to become fainter with decreasing redshift. This effect may be important 
for very deep observations, such as the Hubble Deep Field (Hudson et al. 1998), in 
which the distribution of lens galaxies extends to high redshifts. 

The luminosity L of a lens galaxy can be inferred from the observed flux and an as- 
sumed redshift. Since the scaling relation (2.68) applies to the luminosity measured 
in a particular waveband, the calculation of the luminosity from the apparent mag- 
nitude in a specified filter needs to account for the k-correction. If data are avail- 
able in a single waveband only, an approximate average k-correction relation has 
to be chosen. For multi-colour data, the k-correction can be estimated for individ- 
ual galaxies more reliably. In any case, one assumes a relation between luminosity, 
apparent magnitude, and redshift, 

L = L(m,z). (8.8) 



The final aspect to be discussed here is the redshift of the galaxies. Given that 
a galaxy-galaxy analysis involves at least several hundred foreground galaxies, 
and even more background galaxies, one cannot expect that all of them have 
spectroscopically determined redshifts. In a more favourable situation, multi- 
colour data are given, from which a redshift estimate can be obtained, using the 
photometric redshift method (e.g., Connolly et al. 1995; Gwyn & Hartwick 1996; 
Hogg et al. 1998). These redshift estimates are characteristically accurate to Az ~ 
0. 1 , depending on the photometric accuracy and the number of filter bands in which 
photometric data are measured. For a single waveband only, one can still obtain a 
redshift estimate, but a quite unprecise one. One then has to use the redshift dis- 
tribution of galaxies at that particular magnitude, obtained from spectroscopic or 
multi-colour redshift surveys in other fields. Hence, one assumes that the redshift 



" It is physically realisable in the sense that there exists an isotropic, non-negative particle 
distribution function which gives rise to a spherical density distribution corresponding to 
(8.6). 
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probability distribution p2^{z;m) as a function of magnitudes is known sufficiently 
accurately. 

Suppose for a moment that all galaxy redshifts were known. Then, one can predict 
the effective shear for each galaxy, caused by all the other galaxies around it, 

yi = Xyiji^i-'^j.zuzj,mj), (8.9) 
j 

where Jij is the shear produced by the j-th galaxy on the z-th galaxy image, which 
depends on the angular separation and the mass properties of the j-th galaxy. From 
its magnitude and redshift, the luminosity can be inferred from (8.8), which fixes 
Ov and the halo size s through the scaling relations (2.68) and (8.7). Of course, for 
Zi < Zj, Jij = 0. Although the sum in (8.9) should in principle extend over the whole 
sky, the lensing effect of all foreground galaxies with angular separation larger than 
some Gmax will average to zero. Therefore, the sum can be restricted to separations 
< 6max- We shall discuss the value of Gmax further below. 

In the realistic case of unknown redshifts, but known probability distribution 
Pz{z;in), the shear y, cannot be determined. However, by averaging (8.9) over 
Pz{z\m), the mean and dispersion, (y/) and Oy,/, of the shear for the i-\h galaxy 
can be calculated. Instead of performing the high-dimensional integration explic- 
itly, this averaging can conveniently be done by a Monte-Carlo integration. One 
can generate multiple realisations of the redshift distribution by randomly drawing 
redshifts from the probability density p^{z;m). For each realisation, the can be 
calculated from (8.9). By averaging over the realisations, the mean (ji) and disper- 
sion Oj^i of y; can be estimated. 

8.3 Results 

The first attempt at detecting galaxy-galaxy lensing was made by 
Tyson et al. (1984). They analysed a deep photographic survey consisting of 

35 prime-focus plates with the 4-meter Mayall Telescope at Kitt Peak. An area of 

36 (arc min.)^ on each plate was digitised. After object detection, ~ 12, 000 'fore- 
ground' and ~ 47,000 'background' galaxies were selected by their magnitudes, 
such that the faintest object in the 'foreground' class was one magnitude brighter 
than the brightest 'background' galaxy. This approach assumes that the apparent 
magnitude of an object provides a good indication for its redshift, which seems 
to be valid, although the redshift distributions of 'foreground' and 'background' 
galaxies will substantially overlap. There were ~ 28,000 foreground-background 
pairs with AG < 63" in their sample, but no significant tangential alignment 
could be measured. By comparing their observational results with Monte-Carlo 
simulations, Tyson et al. concluded that the characteristic velocity dispersion of 
a foreground galaxy in their sample must be smaller than about 120kms~^ This 
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limit was later revised upwards to ~ 230kms~ by Kovner & Milgrom (1987) 
who noted that the assumption made in Tyson et al.'s analysis that all background 
galaxies are at infinite distance (i.e., D^s/^^s = 1) was critical. This upper limit is 
fully compatible with our knowledge of galaxy masses. 

This null-detection of galaxy-galaxy lensing in a very large sample of objects ap- 
parently discouraged other attempts for about a decade. After the first weak-lensing 
results on clusters became available, it was obvious that this method requires deep 
data with superb image quality. In particular, the non-linearity of photographic 
plates and mediocre seeing conditions are probably fatal to the detection of this ef- 
fect, owing to its smallness. The shear at 5" from anL* galaxy with Ov = 160kms~^ 
is less than 5%, and pairs with smaller separations are very difficult to investigate 
as the bright galaxy will affect the ellipticity measurement of its close neighbour 
on ground-based images. 

Using a single 9f6 x 9'6 blank field, with a total exposure time of nearly seven 
hours on the 5-meter Hale Telescope on Mount Palomar, Brainerd et al. (1996) 
reported the first detection of galaxy-galaxy lensing. Their co-added image had 
a seeing of O'.'Sl at FWHM, and the 97% completeness limit was r = 26. They 
considered 'foreground' galaxies in the magnitude range 20 < r < 23, and several 
fainter bins for defining the 'background' population, and investigated the distribu- 
tion function p((p) for pairs with separation 5" < AG < 34". The most significant 
deviation of p((p) from a flat distribution occurs for 'background' galaxies in the 
range 23 < r < 24. For fainter (and thus smaller) galaxies, the accuracy of the shape 
determination deteriorates, as Brainerd et al. explicitly show. The number of 'fore- 
ground' galaxies, 'background' galaxies, and pairs, is A^f = 439, A^b = 506, and 
Npuks = 3202. The binned distribution for this 'background' sample is shown in 
Fig. 29, together with a fit according to (8.5). A Kolmogorov-Smimov test rejects 
a uniform distribution of p((p) at the 99.9% level, thus providing the first detection 
of galaxy-galaxy lensing. 
Brainerd et al. 

performed a large number of tests to check for possible systematic errors, including 
null tests (e.g., replacing the positions of 'foreground' galaxies by random points, or 
stars), splitting the whole sample into various subsamples (e.g., inner part vs. outer 
part of the image, upper half vs. lower half etc.), and these tests were passed satis- 
factorily. Also a slight PSF anisotropy in the data, or contamination of the ellipticity 
measurement of faint galaxies by brighter neighbouring galaxies, cannot explain 
the observed relative alignment, as tested with extensive simulations, so that the 
detection must be considered real. 
Brainerd et al. 

then quantitatively analysed their observed alignment, using the model outlined in 
Sect. 8.2, with a = 4. The predictions of the model were inferred from Monte-Carlo 
simulations, in which galaxies were randomly distributed with the observed number 
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Fig. 29. The probability distribution p{(p) for the 3202 foreground-background pairs 

(20 < r < 23 and 23 < r < 24, respectively) with 5" < A0 < 34" in the sample used by 
Brainerd et al. (1996), together with the best fit according to (8.5). The observed distribu- 
tion is incompatible with a flat distribution (dotted line) at a high confidence level of 99.9% 
(Fig. 2a of Brainerd et al.). 



density, and redshifts were assigned according to a probability distribution p^{z\m), 
for which they used a slight extrapolation from existing redshift surveys, together 
with a simple prescription for the k-correction in (8.8) to assign luminosities to 
the galaxies. The ellipticity for each background galaxy image was then obtained 
by randomly drawing an intrinsic ellipticity, adding shear according to (8.9). The 
simulated probability distribution p{(^) was discretised into several bins in angular 
separation AG, and compared to the observed orientation distribution, using y}- 
minimisation with respect to the model parameters Oy,* and s^. The result of this 
analysis is shown in Fig. 30. The shape of the x^-contours is characteristic in that 
they form a valley which is relatively narrow in the Ov,* -direction, but extends very 
far out into the 5*-direction. Thus, the velocity dispersion Oy,* can significantly be 
constrained with these observations, while only a lower limit on s-^ can be derived. 
Formal 90% confidence limits on Oy^^ are ~ 100 kms^ and ~ 210kms \ with a 
best-fitting value of about 160kms~^ whereas the 1- and 2-a lower limits on are 
25 kpc and ~ 10 kpc, respectively. 

Finally, Brainerd et al. studied the dependence of the lensing signal (yt) on the 
colour of their 'background' sample, by splitting it into a red and a blue half. 
The lensing signal of the former is compatible with zero on all scales, while the 
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Fig. 30. Contours of constant in the parameter plane, where = \/2av.*, ob- 

tained from a comparison of the observed tangential alignment (yt) with the distribution 
found in Monte-Carlo simulations. The solid contours range from 0.8 (iimermost) to 8 per 
degree of freedom; the dotted curve displays X'^ = 1 per degree of freedom. (Fig. 7 of 
Brainerd et al.). 

blue sample reveals a strong signal which decreases with angular separation as ex- 
pected. This result is in accordance with that discussed in Sect. 5.5.3, where the 
blue galaxies showed a stronger lensing signal as well, indicating that their redshift 
distribution extends to larger distances. 

We have discussed the work of Brainerd et al. (1996) in some detail since it pro- 
vided the first detection of galaxy-galaxy lensing, and since it is so far remains 
the only one obtained from the ground. Also, their careful analysis exemplifies the 
difficulties in deriving a convincing result. 
Griffiths et al. (1996) 

analysed the images from the Hubble Space Telescope Medium Deep Survey 
(MDS) in terms of galaxy-galaxy lensing. The MDS is an imaging survey, us- 
ing parallel data obtained with the WFPC2 camera on-board HST. They iden- 
tified 1600 'foreground' (15 < / < 22) and 14000 'background' (22 < / < 26) 
galaxies. Owing to the spatial resolution of the HST, a morphological classifi- 
cation of the foreground galaxies could be performed, and spiral and elliptical 
galaxies could separately be analysed. They considered the mean orientation an- 
gle ((p) = 7u/4 + 7t~^(Yt)(l/|e(*^|) as a statistical variable, and scaled the truncation 
radius in their mass models in proportion to the half-light radius. They found that 
Ov,* = 220km s^^ and Oy,* = 160kms^^ are compatible with their shear data for 
elliptical and spiral galaxies, respectively. For their sample of elliptical foreground 
galaxies, they claim that the truncation radius must be more than ten times the half- 
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light radius to fit their data, and that a de Vaucouleurs mass profile is excluded. 
Unfortunately, no significance levels are quoted. 



A variant of the method for a quantitative analysis of galaxy-galaxy lensing was 
developed by Schneider & Rix (1997). Instead of a x^-analysis of (yt) in angular 
separation bins, they suggested a maximum-likelihood analysis, using the individ- 
ual galaxy images. In their Monte-Carlo approach, the galaxy positions (and mag- 
nitudes) are kept fixed, and only the redshifts of the galaxies are drawn from their 
respective probability distribution p^{z;m), as described at the end of Sect. 8.2. The 
resulting log-likelihood function 



2 



< = -E%T7;r--Ei»K+4.) 



(8.10) 



where p is the dispersion of intrinsic ellipticity distribution, here assumed to be a 
Gaussian, can then be maximised with respect to the model parameters, e.g., Oy,* 
and s^. Extensive simulations demonstrated that this approach, which utilises all 
of the information provided by observations, yields an unbiased estimate of these 
model parameters. Later, Erben (1997) showed that this remains valid even if the 
lens galaxies have elliptical projected mass profiles. 

This method was applied to the deep multi-colour imaging data of the 
Hubble Deep Field (HDF; Williams et al. 1996) by Hudson et al. (1998), after 
Deir Antonio & Tyson (1996) detected a galaxy-galaxy lensing signal in the HDF 
on an angular scale of < 5". The availability of data in four wavebands allows 
an estimate of photometric redshifts, a method demonstrated to be quite reliable 
by spectroscopy of HDF galaxies (e.g., Hogg et al. 1998). The accurate redshift 
estimates, and the depth of the HDF, compensates for the small field-of-view of 
~ 5 arcmin^. A similar study of the HDF data was carried out by the Caltech group 
(see Blandford et al. 1998). 

In order to avoid k-corrections, using the multi-colour photometric data to relate all 

magnitudes to the rest-frame B-band, Hudson et al. considered lens galaxies with 
redshift z < 0.85 only, leaving 208 galaxies. Only such source-lens pairs for which 
the estimated redshifts differ by at least 0.5 were included in the analysis, giving 
about 10"^ foreground-background pairs. They adopted the same parameterisation 
for the lens population as described in Sect. 8.2, except that the depth of the HDF 
suggests that the fiducial luminosity should be allowed to depend on redshift, 
°« (1 Assuming no evolution, ^ = 0, and a TuUy-Fisher index of 1/a = 

0.35, they found Oy,* = (160± 30)kms^^. Various control tests were performed 
to demonstrate the robustness of this result, and potential systematic effects were 
shown to be negligible. 

As in the previous studies, halo sizes could not be significantly constrained. The 
lensing signal is dominated by spiral galaxies at a redshift of z ~ 0.6. Comparing 
the TuUy-Fisher relation at this redshift to the local relation, the lensing results 
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indicate that intermediate-redshift galaxies are fainter than local spirals by 1 ± 0.6 
magnitudes in the B-band, at fixed circular velocity. 

Hence, all results reported so far yield compatible values of a^^*, but do not allow 
upper bounds on the halo size to be set. The flatness of the likelihood surface in 
the 5* -direction shows that a measurement of 5* requires much larger samples than 
used before. We can understand the insensitivity to in the published analyses 
at least qualitatively. The shear caused by a galaxy at a distance of, say, lOOkpc 
is very small, of order 1%. This implies that the difference in shear caused by 
galaxies with truncation radius of 20kpc and s = lOOkpc is very small indeed. In 
addition, there are typically other galaxies closer to the line-of-sight to background 
galaxies which produce a larger shear, making it more difficult to probe the shear 
of widely separated foreground galaxies. Hence, to probe the halo size, many more 
foreground-background pairs must be considered. In addition, the angular scale 
©max within which pairs are considered needs to be larger than the angular scale of 
the truncation radius at typical redshifts of the galaxies, and on the other hand, 0max 
should be much smaller than the size of the data field available. Hence, to probe 
large scales of the halo, wide-field imaging data are needed. 

There is a related problem which needs to be understood in greater detail. Since 

galaxies are clustered, and probably (biased) tracers of an underlying dark matter 
distribution (e.g., most galaxies may live in groups), it is not evident whether the 
shear caused by a galaxy at a spatial separation of, say, lOOkpc is caused mainly by 
the dark matter halo of the galaxy itself, or rather by the dark-matter halo associated 
with the group. Here, numerical simulations of the dark matter may indicate to 
which degree these two effects can be separated, and observational strategies for 
this need to be developed. 



8.4 Galaxy-Galaxy Lensing in Galaxy Clusters 

An interesting extension of the work described above aims at the investigation of 
the dark-matter halo properties of galaxies within galaxy clusters. In the hierarchi- 
cal model for structure formation, clusters grow by mergers of less massive haloes, 
which by themselves formed by merging of even smaller substructures. Tidal forces 
in clusters, possible ram-pressure stripping by the intra-cluster medium, and close 
encounters during the formation process, may affect the haloes of galaxies, most 
of which presumably formed at an early epoch. Therefore, it is unclear at present 
whether the halo properties of galaxies in clusters are similar to those of field galax- 
ies. 

Galaxy-galaxy lensing offers an exciting opportunity to probe the dark galaxy 
haloes in clusters. There are several differences between the investigation of field 
and of cluster galaxies. First, the number of massive galaxies in a cluster is fairly 
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small, so the statistics for a single cluster will be limited. This can be compensated 
by investigating several clusters simultaneously. Second, the image distortion is de- 
termined by the reduced shear, g — y/{l —k). For field galaxies, where the shear 
and the surface mass density is small, one can set g ~ y, but this approximation 
no longer holds for galaxies in clusters, where the cluster provides K substantially 
above zero. This implies that one needs to know the mass distribution of the cluster 
before the statistical properties of the massive galaxy haloes can be investigated. 
On the other hand, it magnifies the lensing signal from the galaxies, so that fewer 
cluster galaxies are needed to derive significant lensing results compared to field 
galaxies of similar mass. Third, most cluster galaxies are of early type, and thus 
their ay,* - and consequently, their lensing effect - is expected to be larger than for 
typical field galaxies. 

In fact, the lensing effect of individual cluster galaxies can even be seen from 
strong lensing. Modelling clusters with many strong-lensing constraints (e.g., 
several arcs, multiple images of background galaxies), the incorporation of in- 
dividual cluster galaxies turns out to be necessary (e.g., Kassiola et al. 1992; 
Wallington et al. 1995; Kneib et al. 1996). However, the resulting constraints are 
relevant only for a few cluster galaxies which happen to be close to the strong- 
lensing features, and mainly concern the mass of these galaxies within ~ 
lO/i-i kpc. 

The theory of galaxy-galaxy lensing in clusters was developed in 
Natarajan & Kneib (1997) and Geiger & Schneider (1998), using several dif- 
ferent approaches. The simplest possibility is related to the aperture mass method 
discussed in Sect. 5.3.1. Measuring the tangential shear within an annulus around 
each cluster galaxy, perhaps including a weight function, permits a measurement 
of the aperture mass, and thus to constrain the parameters of a mass model for the 
galaxies. Provided the scale of the aperture is sufficiently small, the tidal field of 
the cluster averages out to first order, and the local influence of the cluster occurs 
through the local surface mass density k. In particular, the scale of the aperture 
should be small enough in order to exclude neighbouring cluster galaxies. 

A more sophisticated analysis starts from a mass model of the cluster, as ob- 
tained by one of the reconstruction techniques discussed in Sect. 5, or by a pa- 
rameterised mass model constructed from strong-lensing constraints. Then, pa- 
rameterised galaxy models are added, again with a prescription similar to that of 
Sect. 8.2, and simultaneously the mass model of the cluster is multiplied by the 
relative mass fraction in the smoothly distributed cluster mass (compared to the to- 
tal mass). In other words, the mass added by inserting galaxies into the cluster is 
subtracted from the smooth density profile. From the observed galaxy ellipticities, 
a likelihood function can be defined and maximised with respect to the parameters 
(Oy,*, s^) of the galaxy model. 
Natarajan et al. (1998) 
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applied this method to WFPC2 images of the cluster AC 114 (zd = 0.31). They 
concluded that most of the mass of a fiducial L^, cluster galaxy is contained in a 
radius of ~ 15kpc, indicating that the halo size of galaxies in this cluster is smaller 
than that of field galaxies. 

Once the mass contained in the cluster galaxies is a significant fraction of the to- 
tal mass of the cluster, this method was found to break down, or give strongly 
biased results. Geiger & Schneider (1999) modified this approach by performing 
a maximum-likelihood cluster mass reconstruction for each parameter set of the 
cluster galaxies, allowing the determination of the best representation of the global 
underlying cluster component that is consistent with the presence of the cluster 
galaxies and the observed image ellipticities of background galaxies. 

This method was then applied to the WFPC-2 image of the cluster C10939+4713, 
already described in Sect. 5.4. The entropy-regularised maximum-likelihood mass 
reconstruction of the cluster is very similar to the one shown in Fig. 14 (page 104), 
except that the cluster centre is much better resolved, with a peak very close to 
the observed strong lensing features (Trager et al. 1997). Cluster galaxies were se- 
lected according to their magnitudes, and divided by morphology into two sub- 
samples, viz. early-type galaxies and spirals. In Fig. 31 we show the likelihood 
contours in the 5*-Ov.* plane, for both subsets of cluster galaxies. Whereas there is 
no statistically significant detection of lensing by spiral galaxies, the lensing effect 
of early-type galaxies is clearly detected. Although no firm upper limit of the halo 
size can be derived from this analysis owing to the small angular field of the im- 
age (the maximum of the likelihood function occurs at S/z^kpc, and a l-o upper 
limit would be ~ 50/z^^ kpc), the contours 'close' at smaller values of compared 
to the results obtained from field galaxies. By statistically combining several cluster 
images, a significant upper limit on the halo size can be expected. 

It should be noted that the results presented above still contain some uncertainties, 
most notably the unknown redshift distribution of the background galaxies and the 
mass-sheet degeneracy, which becomes particularly severe owing to the small field- 
of-view of WFPC2. Changing the assumed redshift distribution and the scaling pa- 
rameter X in (5.9, page 91) shifts the likelihood contours in Fig. 31 up or down, i.e., 
the determination of o,, * is affected. As for galaxy-galaxy lensing of field galaxies, 
the accuracy can be increased by using photometric redshift estimates. Similarly, 
the allowed range of the mass-sheet transformation can be constrained by combin- 
ing these small-scale images with larger scale ground-based images, or, if possible, 
by using magnification information to break the degeneracy. Certainly, these im- 
provements of the method will be a field of active research in the immediate future. 
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Fig. 31. Results of applying the entropy-regularised maximum-likelihood method for 
galaxy-galaxy lensing to the WFPC2 image of the cluster C10939+4713. The upper and 
lower panels correspond to early-type and spiral galaxies, respectively. The solid lines are 
confidence contours at 68.3%, 95.4% and 99.7%, and the cross marks the maximum of the 
likelihood function. Dashed lines correspond to galaxy models with equal aperture mass 
M*(< 8/j"'kpc) = (0.1,0.5,1.0) X IO^/z^^Mq. Similarly, the dotted lines connect mod- 
els of constant total mass for an L^^-galaxy, of M^, = (0.1,0.5, 1.0,5.0, 10) x 10^^ H'^Mq, 
which corresponds to a mass fraction contained in galaxies of (0.15,0.75, 1.5,7.5, 15)%, 
respectively (from Geiger & Schneider 1999). 
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9 The Impact of Weak Gravitational Light Deflection on the Microwave 
Background Radiation 

9.1 Introduction 



The Cosmic Microwave Background originated in the hot phase after the Big 
Bang, when photons were created in thermal equilibrium with electromagneti- 
cally interacting particles. While the Universe expanded and cooled, the photons 
remained in thermal equilibrium until the temperature was sufficiently low for 
electrons to combine with the newly formed nuclei of mainly hydrogen and he- 
lium. While the formation of atoms proceeded, the photons decoupled from the 
matter due to the rapidly decreasing abundance of charged matter. Approximately 
300, 000 years after the Big Bang, corresponding to a redshift of z 1 , 000, the uni- 
verse became transparent for the radiation, which retained the Planck spectrum it 
had acquired while it was in thermal equilibrium, and the temperature decreased 
in proportion with the scale factor as the Universe expanded. This relic radia- 
tion, cooled to r = 2.73 K, forms the Cosmic Microwave Background (hereafter 
CMB). Penzias & Wilson (1965) detected it as an "excess antenna temperature", 
and Fixsen et al. (1996) used the COBE-FIRAS instrument to prove its perfect 
black-body spectrum. 

Had the Universe been ideally homogeneous and isotropic, the CMB would have 
the intensity of black-body radiation at 2.73 K in all directions on the sky, and would 
thus be featureless. Density perturbations in the early Universe, however, imprinted 
their signature on the CMB through various mechanisms, which are thoroughly 
summarised and discussed in Hu (1995). Photons in potential wells at the time of 
decoupling had to climb out, thus losing energy and becoming slightly cooler than 
the average CMB. This effect, now called the Sachs-Wolfe effect was originally 
studied by Sachs & Wolfe (1967), who found that the temperature anisotropics in 
the CMB trace the potential fluctuations on the 'surface' of decoupling. CMB fluc- 
tuations were first detected by the COBE-DMR experiment (Smoot et al. 1992) and 
subsequently confirmed by numerous ground-based and balloon-borne experiments 
(see Smoot 1997 for a review). 

The interplay between gravity and radiation pressure in perturbations of the cos- 
mic 'fluid' before recombination gave rise to another important effect. Radiation 
pressure is only effective in perturbations smaller than the horizon. Upon entering 
the horizon, radiation pressure provides a restoring force against gravity, leading 
to acoustic oscillations in the tightly coupled fluid of photons and charged parti- 
cles, which cease only when radiation pressure drops while radiation decouples. 
Therefore, for each physical perturbation scale, the acoustic oscillations set in at 
the same time, i.e. when the horizon size becomes equal the perturbation size, and 
they end at the same time, i.e. when radiation decouples. At fixed physical scale. 
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these oscillations are therefore coherent, and they show up as distinct peaks (the so- 
called Doppler peaks) and troughs in the power spectrum of the CMB fluctuations. 
Perturbations large enough to enter the horizon after decoupling never experience 
these oscillations. Going through the CMB power spectrum from large to small 
scales, there should therefore be a 'first' Doppler peak at a location determined by 
the horizon scale at the time of decoupling. 

A third important effect sets in on the smallest scales. If a density perturbation is 
small enough, radiation pressure can blow it apart because its self-gravity is too 
weak. This effect is comparable to the Jeans' criterion for the minimal mass re- 
quired for a pressurised perturbation to collapse. It amounts to a suppression of 
small-scale fluctuations and is called Silk damping, leading to an exponential de- 
cline at the small-scale end of the CMB fluctuation power spectrum. 

Other effects arise between the 'surface' of decoupling and the observer. 
Rees & Sciama (1968) pointed out that large non-linear density perturbations be- 
tween the last-scattering surface and us can lead to a distinct effect if those fluc- 
tuations change while the photons traverse them. Falling into the potential wells, 
they experience a stronger blue-shift than climbing out of them because expansion 
makes the wells shallower in the meantime, thus giving rise to a net blue-shift of 
photons. Later, this effect was re-examined in the framework of the 'Swiss-Cheese' 
(Dyer 1976) and 'vacuole' (Nottale 1984) models of density perturbations in an ex- 
panding background space-time. The masses of such perturbations have to be very 
large for this effect to become larger than the Sunyaev-Zel'dovich effect]^ due to 
the hot gas contained in them; Dyer (1976) estimated that masses beyond lO^^M© 
would be necessary, a value four to five orders of magnitude larger than that of 
typical galaxy clusters. 

The gravitational lens effect of galaxy clusters moving transverse to the line-of- 
sight was investigated by Birkinshaw & Gull (1983) who found that a cluster with 
~ lO^^M© and a transverse velocity of ~ 6000kms^^ should change the CMB 
temperature by ~ 10^^ K. Later, Gurvits & Mitrofanov (1986) re-investigated this 
effect and found it to be about an order of magnitude smaller. 

Cosmic strings as another class of rapidly moving gravitational lenses were studied 
by Kaiser & Stebbins (1984) who discussed that they would give rise to step-like 
features in the CMB temperature pattern. 



° The (thermal) Sunyaev-Zel'dovich effect is due to Compton-upscattering of CMB pho- 
tons by thermal electrons in the hot plasma in galaxy clusters. Since the temperature of 
the electrons is much higher than that of the photons, CMB photons are effectively re- 
distributed towards higher energies. At frequencies lower than 212GYlz, the CMB inten- 
sity is thus decreased towards galaxy clusters; in effect, they cast shadows on the surface of 
the CMB. 
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9.2 Weak Lensing of the CMB 



The introduction shows that the CMB is expected to display distinct features in 
a hierarchical model of structure formation. The CMB power spectrum should be 
featureless on large scales, then exhibit pronounced Doppler peaks at scales smaller 
than the horizon at the time of decoupling, and an exponential decrease due to 
Silk damping at the small-scale end. We now turn to investigate whether and how 
gravitational lensing by large-scale structures can alter these features. 

The literature on the subject is rich (see Blanchard & Schneider 1987, 
Cayon et al. 1993b, Cayon et al. 1993a, Cole & Efstathiou 1989, 

Fukugita et al. 1992, Kashlinsky 1988, Linder 1988, Linder 1990a, 
Linder 1990b, Martinez-Gonzalez et al. 1990, Sasaki 1989, Tomita 1989, 
Watanabe & Tomita 1991), but different authors have sometimes arrived at 
contradicting conclusions. Perhaps the most elegant way of studying weak lensing 
of the CMB is the power-spectrum approach, which was most recently advocated 
by Seljak(1994, 1996). 

We should like to start our discussion by clearly stating two facts concerning the ef- 
fect of lensing on fluctuations in the Cosmic Microwave Background which clarify 
and resolve several apparently contradictory discussions and results in the litera- 
ture. 

(1) If the CMB was completely isotropic, gravitational lensing would have no 
ejfect whatsoever because it conserves surface brightness. In this case, lensing 
would only magnify certain patches in the sky and de-magnify others, but 
since it would not alter the surface brightness in the magnified or de-magnified 
patches, the temperature remained unaffected. An analogy would be observers 
facing an infinitely extended homogeneously coloured wall, seeing some parts 
of it enlarged and others shrunk. Regardless of the magnification, they would 
see the same colour everywhere, and so they would notice nothing despite the 
magnification. 

(2) It is not the absolute value of the light deflection due to lensing which matters, 
but the relative deflection of neighbouring light rays. Imagine a model uni- 
verse in which all light rays are isotropically deflected by the same arbitrary 
amount. The pattern of CMB anisotropics seen by an observer would then be 
coherently shifted relative to the intrinsic pattern, but remain unchanged oth- 
erwise. It is thus merely the dispersion of deflection angles what is relevant 
for the impact of lensing on the observed CMB fluctuation pattern. 
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9.3 CMB Temperature Fluctuations 



In the absence of any lensing effects, we observe at the sky position the intrin- 
sic CMB temperature r(0). There are fluctuations Ar(0) in the CMB temperature 
about its average value {T) = 2.73 K. We abbreviate the relative temperature fluc- 
tuations by 

(9.1) 

in the following. They can statistically be described by their angular auto- 
correlation function 

^t(^) = (x(0)x(0 + ^^)\ , (9.2) 



with the average extending over all positions 0. Due to statistical isotropy, 't,i{^) 
depends neither on the position nor on the direction of but only on the absolute 
separation ^ of the correlated points. 

Commonly, CMB temperature fluctuations are also described in terms of the coef- 
ficients aim of ^ expansion into spherical harmonics, 

= £ aimYnm, (9.3) 

/=0m=-/ 

and the averaged expansion coefficients constitute the angular power spectrum Q 
of the CMB fluctuations, 

Ci = {\aim\^). (9.4) 

It can then be shown that the correlation function ^t(<|>) is related to the power- 
spectrum coefficients C/ through 

Ci = r d^ sin(^)PKcos^)^T(^) , (9.5) 

^0 

with the Legendre functions P/(cos(|)). 

9.4 Auto-Correlation Function of the Gravitationally Lensed CMB 
9. 4. 1 Definitions 

If there are any density inhomogeneities along the line-of-sight towards the last- 
scattering surface at z 1,000 (the 'source plane' of the CMB), a light ray starting 
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— * 

into direction at the observer will intercept the last-scattering surface at the de- 
flected position 

p = e-a(e), (9.6) 
where a(0) is the (position-dependent) deflection angle experienced by the light 

— * 

ray. We will therefore observe, at position 0, the temperature of the CMB at position 
P,or 

r(p) = r'(0) = r[0-a(0)]. (9.7) 

The intrinsic temperature autocorrelation function is thus changed by lensing to 

^^rW = (x[0-a(0)]x[(0 + ^)-a(0 + ^)]) . (9.8) 

For simplicity of notation, we further abbreviate a(0) = a and a(0 + ^) = a' in the 
following. 

9.4.2 Evaluation 

In this section we evaluate the modified correlation function (9.8) and quantify 
the lensing effects. For this purpose, it is convenient to decompose the relative 
temperature fluctuation x(0) into Fourier modes, 

^^^^^ / (^^^^)^^P^^^)- (^-^^ 

The expansion of x(0) into Fourier modes rather than into spherical harmonics is 
permissible because we do not expect any weak-lensing effects on large angular 
scales, so that we can consider 7(0) on a plane locally tangential to the sky rather 
than on a sphere. 

We insert the Fourier decomposition (9.9) into the expression for the correlation 
function (9.8) and perform the average. We need to average over ensembles and 

— * 

over the random angle between the wave vector / of the temperature modes and the 

— * 

angular separation ^ of the correlated points. The ensemble average corresponds 
to averaging over realisations of the CMB temperature fluctuations in a sample of 
universes or, since we focus on small scales, over a large number of disconnected 
regions on the sky. This average introduces the CMB fluctuation spectrum Pjil), 
which is defined by 

^T(r)r(?)) = {2nfd^^\I-T)PT{l) . (9.10) 
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Averaging over the angle between / and the position angle ^ gives rise to the zeroth- 
order Bessel function of the first kind, Jo{x). These manipulations leave eq. (9.8) in 
the form 



^tW= I — Pr(/)(exp[i/(a-a') 



Jo(/^) 



(9.11) 



The average over the exponential in eq. (9.11) remains to be performed. To do so, 
we first expand the exponential into a power series, 

exp(u5a)) = £<M, (9.12) 
where 5a = a — a' is the deflection-angle difference between neighbouring light 

— * 

rays with initial angular separation (|). We now assume that the deflection angles are 
Gaussian random fields. This is reasonable because (i) deflection angles are due to 
Gaussian random fluctuations in the density-contrast field as long as the fluctuations 
evolve linearly, and (ii) the assumption of linear evolution holds well for redshifts 
where most of the deflection towards the last-scattering surface occurs. Of course, 
this makes use of the commonly held view that the initial density fluctuations are 
of Gaussian nature. Under this condition, the odd moments in eq. (9. 12) all vanish. 
It can then be shown that 

exp(ir5a)^ = exp (^-^^^^^^ (9-13) 

holds exactly, where o^({|)) is the deflection-angle dispersion, 

a2((|)) = ((a-a')^> . (9.14) 

Even if the assumption that 5d is a Gaussian random field fails, eq. (9.13) still holds 
approximately. To see this, we note that the CMB power spectrum falls sharply on 

1 /2 

scales / > /c ~ (IO'^Iq )^^. The scale /c is set by the width of the last- scattering 
surface at redshift z ~ 1 , 000. Smaller-scale fluctuations are efficiently damped by 
acoustic oscillations of the coupled photon-baryon fluid. Typical angular scales 

in the CMB fluctuations are therefore considerably larger than the difference be- 
tween gravitational deflection angles of neighbouring rays, 5a, so that /(a — a') is 
a small number. Hence, ignoring fourth-order terms in /5a, the remaining exponen- 
tial in (9. 1 1) can be approximated by 



exp(i/5a) ) pa 1 - ^/^o^({|)) k, exp 



(9.15) 

Therefore, the temperature auto-correlation function modified by gravitational lens- 
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ing can safely be written. 



^t(^)= /o°°^^T(/)exp 



Jo(/^) 



(9.16) 



This equation shows that the intrinsic temperature-fluctuation power spectrum is 
convolved with a Gaussian function in wave number / with dispersion ((|)). The 
effect of lensing on the CMB temperature fluctuations is thus to smooth fluctuations 
on angular scales of order or smaller than 



9.4.3 Alternative Representations 

Equation (9.16) relates the unlensed CMB power spectrum to the lensed tempera- 
ture auto-correlation function. Noting that Pr(0 is the Fourier transform of ^t((|)), 

Pr(/) = j d2^^T((|)) exp(-i/$) = 211 j (|)d(|)^T((|)) hm , (9.17) 
we can substitute one for the other. Isotropy permitted us to perform the integration 

— * — * 

over the (random) angle between / and ^ in the last step of (9.17). Inserting (9.17) 
into (9.16) leads to 

^^(^)= yfdf^T(f)i^(^,f). (9.18) 

The kernel {])') is given by 



K{^,^')= / /d/Jo(/(^) Jo(/f)exp 



a2((^) 



exp 



r+(|)' 

" 2a2((j)) 



lo 



a2((^). 



(9.19) 



where lo(^) is the modified zeroth-order Bessel function. Equation (6.663.2) of 
Gradshteyn & Ryzhik (1994) was used in the last step. As will be shown below, 
o((|)) ^ 1, so that the argument of Iq is generally a very large number. Noting that 
Io(x) (27D:)~^/^exp(x) for x — > <», we can write eq. (9.16) in the form 



^t(^) 



1 



(27i(|))V2a((j)) 



df^'V2^^(^/) exp 



f\2 



(^-f) 

2a2((|)) 



(9.20) 



Like eq. (9.16), this expression shows that lensing smoothes the intrinsic temper- 
ature auto-correlation function ^t((|>) on angular scales of (|) a((|)) and smaller. 
Note in particular that, if a((|)) — > 0, the exponential in (9.20) tends towards a Dirac 
delta distribution. 



1 



lim , 

o((t))->o V27ta((|)) 



exp 



/\2 



2a2((|)) 



(9.21) 
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so that the lensed and unlensed temperature auto-correlation functions agree. 



Likewise, one can Fourier back-transform eq. (9.16) to obtain a relation between 
the lensed and the un-lensed CMB power spectra. To evaluate the resulting integral, 

it is convenient to assume a((|)) = £(]), with £ being either a constant or a slowly 
varying function of (j). This assumption will be justified below. One then finds 




(9.22) 



For 8 <C 1, this expression can be simplified to 

f°° dl 

^t(0= / ^=-Pr(/)exp 
Jo v2ml 

9.5 Deflection-Angle Variance 

9.5. 1 Auto-Correlation Function of Deflection Angles 

We proceed by evaluating the dispersion o^((|)) of the deflection angles. This is 
conveniently derived from the deflection-angle auto-correlation function, 

^a(^) = {ad!) . (9.24) 

Note that the correlation function of a is the sum of the correlation functions of the 
components of a, 

^g. = (a a') = (aia'i) + (aza^) = + U • (9-25) 

In terms of the autocorrelation function, the dispersion a^((|)) can be written 

a\<^) = ( [a - ccf ) = 2 MO) - • (9.26) 

The deflection angle is given by eq. (6.1 1) on page 121 in terms of the Newtonian 
potential 4> of the density fluctuations 5 along the line-of-sight. For lensing of the 
CMB, the line-of-sight integration extends along the (unperturbed) light ray from 
the observer at w = to the last-scattering surface at w{z ~ 1000) ; see the derivation 
in Sect. 6.2 leading to eq. (6.11, page 121). 

We introduced the effective convergence in (6.14, page 122) as half the divergence 
of the deflection angle. In Fourier space, this equation can be inverted to yield the 



{i-i'Y 

2e2/2 



(9.23) 
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Fourier transform of the deflection angle. 



&(/) = -a%W/. (9.27) 



The deflection-angle power spectrum can therefore be written as 



^aW = ^^K(/). (9.28) 



The deflection-angle autocorrelation function is obtained from eq. (9.28) via 
Fourier transformation. The result is 

= I exp(-i/$) = 2k JJldlP^il) ^ , (9.29) 

similar to the form (6.58, page 144), but here the filter function is no longer a 
function of the product /(|) only, but of / and (|) separately. 



We plot (|) ^F(/, (|)) in Fig. 32. For fixed (]), the filter function suppresses small-scale 
fluctuations, and it tends towards ([)) — > {tiI)~^ for / 0. 

Inserting Pk(0 i^t^ (9.29), we find the explicit expression for the deflection-angle 
auto-correlation function. 



Jo 

f°° dk 

X p^{k,w')]o[fK{w')^]. (9.31) 

Jo 2Kk 

Despite the obvious similarity between this result and the magnification auto- 
correlation function (6.34 on page 130), it is worth noting two important differ- 
ences. First, the weighting of the integrand along the line-of-sight differs by a fac- 
tor of f^{w') because we integrate deflection-angle components rather than the 
convergence, i.e. first rather than second-order derivatives of the potential 4>. Con- 
sequently, structures near the observer are weighted more strongly than for mag- 
nification or shear effects. Secondly, the wave-number integral is weighted by 
rather than k, giving most weight to the largest-scale structures. Since their evolu- 
tion remains linear up to the present, it is expected that non-linear density evolution 
is much less important for lensing of the CMB than it is for cosmic magnification 
or shear. 
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Fig. 32. The filter function F{l,i^) as defined in eq. (9.30), divided by (|)^, is shown as a 
function of /(|). Compare Fig. 22 on page 141. For fixed (|), the filter function emphasises 
large-scale projected density perturbations (i.e. structures with small /). 

9. 5. 2 Typical Angular Scale 

A typical angular scale ([)§ for the coherence of gravitational light deflection can be 
obtained as 



As eq. (9.31) shows, the deflection-angle auto-correlation function depends on ^ 
only through the argument of the Bessel function Jo{x) . For small arguments x, the 
second-order derivative of the Jo{x) is approximately Jo"(-^) ~ —^o{x)/2. Differen- 
tiating ^^{^) twice with respect to (|), and comparing the result to the expression for 
the magnification auto-correlation function ^/^((t)) in eq. (6.34, page 130), we find 




(9.32) 



3^2 




(9.33) 



and thus 



^a(O) 



(9.34) 
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We shall estimate (|)g later after giving a simple expression for ^ot((|)). The angle (|)g 
gives an estimate of the scale over which gravitational light deflection is coherent. 



9.5.3 Special Cases and Qualitative Expectations 

We mentioned before that it is less critical here to assume linear density evolu- 
tion because large-scale density perturbations dominate in the expression for ^ci(^)- 
Specialising further to an Einstein-de Sitter universe so that w ^ 1c/Hq, eq. (9.31) 
simplifies to 

^aW = ^w_^' dy{\-yf ^^Pf\k)h{wy^). (9.35) 
with wy = w'. 

Adopting the model spectra for HDM and CDM specified in eq. (6.36, page 131) 
and expanding ^a((|)) in a power series in (j), we find, to second order in (|), 
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for HDM 

(9.36) 

for CDM 



Combining these expressions with eqs. (9.34) and (6.37, page 131), we find for the 
deflection- angle coherence scale (|)g 

(|)gRi3(wA:o)~^ (9.37) 

It is intuitively clear that (|)g should be determined by (wko)^^ . Since is the 
typical length scale of light-deflecting density perturbations, it subtends an angle 
{wkQ)~^ at distance w. Thus the coherence angle of light deflection is given by 
the angle under which the deflecting density perturbation typically appears. The 
source distance w in the case of the CMB is the comoving distance to z = 1,000. 
In the Einstein-de Sitter case, w = 2 in units of the Hubble length. Hence, with 
kQ^ w 12(^2o^^)Mpc [cf. eq. (2.49), page 25], we have wko pa 500. Therefore, the 
angular scale of the deflection-angle auto-correlation is of order 

(|)g 6 X 10-^ ^ 20' . (9.38) 

To lowest order in (|), the deflection- angle dispersion (9.26) reads 

a2((|))oc (wA:o)^(|)^ (9.39) 

The dispersion o({|)) is plotted in Fig. 33 for the four cosmological models specified 
in Tab. 1 on page 1 17 for linear and non-linear evolution of the density fluctuations. 
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The behaviour of a((|)) expressed in eq. (9.39) can qualitatively be understood de- 
scribing the change in the transverse separation between light paths as a random 
walk. Consider two light paths separated by an angle ()) such that their comoving 
transverse separation at distance w is w^. Let be the typical scale of a potential 
fluctuation We can then distinguish two different cases depending on whether 
is larger or smaller than k~^.lfw^>k~^, the transverse separation between the 
light paths is much larger than the typical potential fluctuations, and their deflection 
will be incoherent. It will be coherent in the opposite case, i.e. ifw^<k~^. 

When the light paths are coherently scattered passing a potential fluctuation, their 
angular separation changes by 5(|)i f» w(|) V^(2A:~^V^4)/c^), which is the change 
in the deflection angle across w^. If we replace the gradients by the inverse of the 
typical scale, k, we have 5(|)i ^ Iw^k^^/c^. Along a distance w, there are N ^ kw 

such potential fluctuations, so that the total change in angular separation is expected 
tobe5(|)RiAri/25(|)j. 

In case of incoherent scattering, the total deflection of each light path is expected 
to be 5(|) N^/^ (2A:"i V^O/c^) ?a N^/^ independent of (|). Therefore, 



This illustrates that the dependence of on (wk)^^^ for small (|) is merely a 

consequence of the random coherent scattering of neighbouring light rays at po- 
tential fluctuations. For large (|), a((|)) becomes constant, and so o((|))(|)^ — > 0. As 
Fig. 33 shows, the dispersion o((|)) increases linearly with (|) for small ([) and flattens 
gradually for (|) > (10 — 20)' as expected, because (|)g divides coherent from 
incoherent scattering. 

9.5.4 Numerical Results 

The previous results were obtained by specialising to linear evolution of the den- 
sity contrast in an Einstein-de Sitter universe. For arbitrary cosmological parame- 
ters, the deflection-angle dispersion has to be computed numerically. We show in 
Fig. 33 examples for a((|)) numerically calculated for the four cosmological mod- 
els detailed in Tab. 1 on page 117. Two curves are plotted for each model. The 
somewhat steeper curves were obtained for linear, the others for non-linear density 
evolution. 

Figure 33 shows that typical values for the deflection-angle variance in cluster- 
normalised model universes are of order o((|)) ~ (0.03 —0.1)' on angular scales 
between (|) Ri (1 — 10)'. While the results for different cosmological parameters are 
fairly close for cluster-normalised CDM, o({|)) is larger by about a factor of two 
for CDM in an Einstein-de Sitter model normalised to Og = 1. For the other cos- 




for (j) < (wk) 
for (|) > (wk) 



-1 



-1 



(9.40) 



198 




Fig. 33. The deflection-angle variance a((|)) is shown for the four cosmological models 
specified in Tab. 1 on page 117. Two curves are shown for each model, one for hnear 
and one for non-linear evolution of the density fluctuations. Solid curves: SCDM; dotted 
curve: aCDM; short-dashed curves: OCDM; and long-dashed curves: ACDM. The some- 
what steeper curves are for linear density evolution. Generally, the deflection-angle variance 
increases linearly with (|) for small (|), and flattens gradually for (|) > 20'. At (|) 10', a((|)) 
reaches « 0.1', or « 0.01(|), for the cluster-normalised model universes (all except aCDM; 
dotted curves). As expected, the effect of non-linear density evolution is fairly moderate, 
and most pronounced on small angular scales, (|) < 10'. 

mological models, the differences between different choices for the normalisation 
are less pronounced. The curves shown in Fig. 33 confirm the qualitative behaviour 
estimated in the previous section: The variance a((|)) increases approximately lin- 
early with ^ as long as ^ is small, and it gradually flattens off at angular scales 
^ > ^ 20'. 

In earlier chapters, we saw that non-linear density evolution has a large impact on 
weak gravitational lensing effects, e.g. on the magnification auto-correlation func- 
tion ({])). As mentioned before, this is not the case for the deflection-angle auto- 
correlation function ^ci((|)) and the variance a((|)) derived from it, because the filter 
function relevant here suppresses small-scale density fluctuations for which 

the effect of non-linear evolution are strongest. Therefore, non-linear evolution is 
expected to have less impact here. Only on small angular scales (|), the filter function 
extends into the sufficiently non-linear regime. The curves in Fig. 33 confirm and 
quantify this expectation. Only on scales of (|) < 10', the non-linear evolution does 
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have some effect. Obviously, non-linear evolution increases the deflection-angle 
variance in a manner quite independent of cosmology. At angular scales (j) ~ 1', the 
increase amounts to roughly a factor of two above the linear results. 

9.6 Change ofCMB Temperature Fluctuations 

9. 6. 1 Summary of Previous Results 

We are now ready to justify assumptions and approximations made earlier, and to 
quantify the impact of weak gravitational lensing on the Cosmic Microwave Back- 
ground. The main assumptions were that (i) the deflection-angle variance a((j)) is 
small, and (ii) o({|)) ~ £{|), with £ a (small) constant or a function slowly varying 
with (|). The results obtained in the previous section show that a((|)) is typically 
about two orders of magnitude smaller than ()), confirming £ -C 1. Likewise, Fig. 33 
shows that the assumption a((|)) (|) is valid on angular scales smaller than the co- 
herence scale for the deflection, (j) < (j)g f=:i 20'. As we have seen, this proportionality 
is a mere consequence of random coherent scattering of neighbouring light rays 
in the fluctuating potential field. For angles larger than (|)g, a((|)) gradually levels 
off to become constant, so that the ratio between o((|)) and (|) tends to zero while ()) 
increases further beyond (|)g. We can thus broadly summarise the numerical results 
on the deflection-angle variance by 



which is valid for cluster-normalised CDM quite independent of the cosmological 
model; in particular, o((|)) < 1' f^i 3 x 10~^radians for all (|). 

9. 6. 2 Simplifications 

Accordingly, the argument of the exponential in eq. (9.16) is a truly small number. 
Even for large Z ^ 103,/2o2((^) < l.We can thus safely expand the exponential into 
a power series, keeping only the lowest-order terms. Then, eq. (9.16) simplifies to 



where we have used that the auto-correlation function ^T(<t>) is the Fourier trans- 
form of the power spectrum Pt{1). Employing again the approximate relation 
Jo'{x) ft! — Jo(jc)/2 which holds for small x, we notice that 




for ^<20' 
for (|) > 20' 



(9.41) 



(9.42) 




(9.43) 
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We can introduce a typical angular scale (|)c for the CMB temperature fluctuations 
in the same manner as for light deflection in eq. (9.32). We define by 



^ 1 a2^T((t)) 



^t(O) 3(1)2 

SO that, up to second order in (|), eq. (9.42) can be approximated as 



(9.44) 

(|)=0 



^tW^^tW-^^t(O). (9.45) 



We saw earlier that o((|)) ^ £(|) for ^ < ^g. Equation (9.45) can then further be 
simplified to read 

^T(<^)«^^T((^)-e2^T(0)^. (9.46) 

YC 



In analogy to eq. (9.26), we can write the mean-square temperature fluctuations of 
the CMB between two beams separated by an angle (j) as 

a2(^) = ([T(e) + = 2 [^t(O) -^t(^)] . (9.47) 
Weak gravitational lensing changes this relative variance to 

a^2 = 2[^^(0)-^^(^)] . (9.48) 
Using eq. (9.46), we see that the relative variance is increased by the amount 

Aa2 ((^) = a^2((^) - 4 ((^) ^ e2^T(0) ^ . (9.49) 

Now, the auto-correlation function at zero lag, ^t(O), is the temperature-fluctuation 
variance, a^. Hence, we have for the rms change in the temperature variation 

[Ao2(^)]^/' = 80T^. (9.50) 

Weak gravitational lensing thus changes the CMB temperature fluctuations only by 
a very small amount, of order e fa 10"^ for ^ fa 



9.6.3 The Lensed CMB Power Spectrum 

However, we saw in eq. (9.23) that the gravitationally lensed CMB power spectrum 
is smoothed compared to the intrinsic power spectrum. Modes on an angular scale 
{|) are mixed with modes on angular scales {|)±o({|)), i.e. the relative broadening 
5(|)/(|) is of order 2a({|))/(|). For (|) < (|)g ~ 20', this relative broadening is of order 
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2e ~ 2 X 10~^, while it becomes negligible for substantially larger scales because 
a((|)) becomes constant. This effect is illustrated in Fig. 34, where we show the 
unlensed and lensed CMB power spectra for CDM in an Einstein-de Sitter universe. 




Fig. 34. The CMB power spectrum coefficients /(/ + 1)C/ are shown as a function of /. The 
solid line displays the intrinsic power spectrum, the dotted line the lensed power spectrum 
for an Einstein-de Sitter universe filled with cold dark matter. Evidently, tensing smoothes 
the spectrum at small angular scales (large I), while it has no visible effect on larger scales. 
The curves were produced with the CMBf ast code, see Zaldarriaga & Seljak (1998a). 



The figure clearly shows that lensing smoothes the CMB power spectrum on 
small angular scales (large /), while it leaves large angular scales unaffected. 

Lensing effects become visible at / > 500, corresponding to an angular scale of 
(|) < (7c/500)rad ^ 20', corresponding to the scale where coherent gravitational 
light deflection sets in. An important effect of lensing is seen at the high-/ tail of the 
power spectra, where the lensed power spectrum falls systematically above the un- 
lensed one (Metcalf & Silk 1997). This happens because the Gaussian convolution 
kernel in eq. (9.23) becomes very broad for very large I, so that the lensed power 
spectrum at /' can be substantially increased by intrinsic power from significantly 
smaller /. In other words, lensing mixes power from larger angular scales into the 
otherwise feature-less damping tail of Pt{1)- 
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9. 7 Discussion 



Several different approximations entered the preceding derivations. Firstly, the 
deflection-angle variance a((t)) was generally assumed to be small, and for some 
expressions to be proportional to (|) with a small constant of proportionality £. The 
numerical results showed that the first assumption is very well satisfied, and the 
second assumption is valid for (|) < (l)g, the latter being the coherence scale of grav- 
itational light deflection. 

We further assumed the deflection- angle field to be a Gaussian random field, the 
justification being that the deflecting matter distribution is also a Gaussian random 
field. While this fails to be exactly true at late stages of the cosmic evolution, we 
have seen that the resulting expression can also be obtained when o((|)) is small 
and a is not a Gaussian random field; hence, in practice this assumption is not a 
limitation of validity. 

A final approximation consists in the Born approximation. This should also be a 
reasonable assumption at least in the case considered here, where we focus on sta- 
tistical properties of light propagation. Even if the light rays would be bent consid- 
erably, the statistical properties of the potential gradient along their true trajectories 
are the same as along the approximated unperturbed rays. 

Having found all the assumptions made well justifiable, we can conclude that the 
random walk of light rays towards the surface of recombination leads to smoothing 
of small-scale features in the CMB, while large-scale features remain unaffected. 
The border line between small and large angular scales is determined by the angular 
coherence scale of gravitational light deflection by large-scale matter distributions, 
which we found to be of order (|)g 20', corresponding to /g = 27l(|)g ^ 1,000. 
For the smallest angular scales, well into the damping tail of the intrinsic CMB 
power spectrum, this smoothing leads to a substantial re-distribution of power, 
which causes the lensed CMB power spectrum to fall systematically above the 
unlensed one at / > 2000, or (|) < 271/ ~ 10'. Future space-bound CMB obser- 
vations, e.g. by the Planck Surveyor satellite, will achieve angular resolutions of 
order > 5', so that the lensed regime of the CMB power spectrum will be well ac- 
cessible. Highly accurate analyses of the data of such missions will therefore need 
to take lensing effects by large-scale structures into account. 

One of the foremost goals of CMB observations is to derive cosmological param- 
eters from the angular CMB power spectrum C/. Unfortunately, there exists a pa- 
rameter degeneracy in the sense that for any given set of cosmological parame- 
ters fitting a given CMB spectrum, a whole family of cosmological models can 
be found that will fit the spectrum (almost) equally well (Zaldarriaga et al. 1997). 
Metcalf & Silk (1998) showed that the rise in the damping-tail amplitude due to 
gravitational lensing of the CMB can be used to break this degeneracy once CMB 
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observations with sufficiently high angular resolution become available. 

We discussed in Sect. 4.2 how shapes of galaxy images can be quantified with the 
tensor Qij of second surface-brightness moments. Techniques for the reconstruc- 
tion of the intervening projected matter distribution are then based on (complex) 
ellipticities constructed from Qij, e.g. the quantity % defined in (4.4). Similar re- 
construction techniques can be developed by constructing quantities comparable to 
X from the CMB temperature fluctuations x(0). Two such quantities were suggested 
in the literature, namely 

'cfi-4 + 2i'C,i'C,2 (9.51) 
(Zaldarriaga & Seljak 1998b) and 

x,ii-x,22 + 2ix,i2 (9.52) 

(Bemardeau 1997). As usual, comma-preceded indices / denote differentiation with 
respect to G,. 

Finally, it is worth noting that gravitational lensing mixes different types of CMB 
polarisation (the "electric" and "magnetic", or E and B modes, respectively) and 
can thus create 5-type polarisation even when only E-type polarisation is intrinsi- 
cally present (Zaldarriaga & Seljak 1998a). This effect, however, is fairly small in 
typical cosmological models and will only marginally affect future CMB polarisa- 
tion measurements. 
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10 Summary and Outlook 



We have summarised the basic ideas, theoretical developments, and first applica- 
tions of weak gravitational lensing. In particular, we showed how the projected 
mass distribution of clusters can be reconstructed from the image distortion of 
background galaxies, using parameter-free methods, how the statistical mass distri- 
bution of galaxies can be obtained from galaxy-galaxy lensing, and how the larger- 
scale mass distribution in the Universe affects observations of galaxy shapes and 
fluxes of background sources, as well as the statistical properties of the CMB. Fur- 
thermore, weak lensing can be used to construct a mass-selected sample of clusters 
of galaxies, making use only of their tidal gravitational field which leaves an im- 
print on the image shapes of background galaxies. We have also discussed how the 
redshift distribution of these faint and distant galaxies can be derived from lens- 
ing itself, well beyond the magnitude limit which is currently available through 
spectroscopy. 

Given that the first coherent image alignment of faint galaxies around foreground 
clusters was discovered only a decade ago (Fort et al. 1988; Tyson et al. 1990), 
the field of weak lensing has undergone a rapid evolution in the last few years, 
for three main reasons: (i) Theoreticians have recognised the potential power of 
this new tool for observational cosmology, and have developed specific statisti- 
cal methods for extracting astrophysically and cosmologically relevant informa- 
tion from astronomical images, (ii) Parallel to that effort, observers have devel- 
oped new observing strategies and image analysis software in order to minimise 
the influence of instrumental artefacts on the measured properties of faint im- 
ages, and to control as much as possible the point-spread function of the re- 
sulting image. It is interesting to note that several image analysis methods, par- 
ticularly aimed at shape measurements of very faint galaxies for weak gravi- 
tational lensing, have been developed by a coherent effort of theoreticians and 
observers (Bonnet & Mellier 1995; Kaiser et al. 1995; Luppino & Kaiser 1997; 
Van Waerbeke etal. 1997; Kaiser 1999; Rhodes et al. 1999; Kuijken 1999), indi- 
cating the need for a close interaction between these two groups which is imposed 
by the research subject. 

(iii) The third and perhaps major reason for the rapid evolution is the instrumen- 
tal development that we are witnessing. Most spectacular was the refurbishment of 
the Hubble Space Telescope (HST) in Dec. 1993, after which this telescope pro- 
duced astronomical images of angular resolution unprecedented in optical astron- 
omy. These images have not only been of extreme importance for studying multiple 
images of galaxy-scale lens systems (where the angular separation is of order one 
arc second) and for detailed investigations of giant arcs and multiple galaxy im- 
ages in clusters of galaxies, but also for several of the most interesting results of 
weak lensing. Owing to the lack of atmospheric smearing and the reduced sky back- 
ground from space, the shape of fainter and smaller galaxy images can be measured 
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on HST images, increasing the useful number density of background galaxies, and 
thus reducing the noise due to the intrinsic ellipticity distribution. Two of the most 
detailed mass maps of clusters have been derived from HST data (Seitz et al. 1996; 
Hoekstraet al. 1998), and all but one published results on galaxy-galaxy lensing 
are based on data taken with the HST. In parallel to this, the development of as- 
tronomical detectors has progressed quickly. The first weak-lensing observations 
were carried out with CCD detectors of ~ 1,000^ pixels, covering a fairly small 
field-of-view. A few years ago, the first (8K)^ camera was used for astronomical 
imaging. Its 30' x 30' field can be used to map the mass distribution of clusters at 
large cluster-centric radii, to investigate the potential presence of filaments between 
neighbouring clusters (Kaiser et al. 1998), or simply to obtain high-quality data on 
a large area. Such data will be useful for galaxy-galaxy lensing, the search for 
haloes using their lensing properties only, for the investigation of cosmic shear, and 
for homogeneous galaxy number counts on large fields, needed to obtain a better 
quantification of the statistical association of AGNs with foreground galaxies. 

It is easy to foresee that the instrumental developments will remain the driving 
force for this research field. By now, several large-format CCD cameras are ei- 
ther being built or already installed, including three cameras with a one square 
degree field-of-view and adequate sampling of the PSF (MEGAPRIME at CFHT, 
MEGACAM at the refurbished MMT, and OMEGACAM at the newly built VLT 
Support Telescope at Paranal; see the recent account of wide-field imaging instru- 
ments in Amaboldi et al. 1998). Within a few years, more than a dozen 8- to IO- 
meter telescopes will be operating, and many of them will be extremely useful for 
obtaining high-quality astronomical images, due to their sensitivity, their imaging 
properties and the high quality of the astronomical site. In fact, at least one of them 
(SUBARU on Mauna Kea) will be equipped with a large-format CCD camera. One 
might hypothesise that weak gravitational lensing is one of the main science drivers 
to shift the emphasis of optical astronomers more towards imaging, in contrast to 
spectroscopy. For example, the VUT Support Telescope will be fully dedicated to 
imaging, and the fraction of time for wide-field imaging on several other major tele- 
scopes will be substantial. The Advanced Camera for Surveys (ACS) is planned to 
be installed on the HST in 2001 . Its larger field-of-view, better sampling, and higher 
quantum efficiency - compared to the current imaging camera WFPC2 - promises 
to be particularly useful for weak lensing observations. 

Even more ambitious ground-based imaging projects are currently under discus- 
sion. Funding has been secured for the VISTA projectf^ of a 4 m telescope in 
Chile with a field-of-view of at least one square degree. Another 4 m Dark Matter 
Telescope with a substantially larger field-of-view (nine square degrees) is being 
discussed specifically for weak lensing. Kaiser et al. (1999) proposed a new strat- 
egy for deep, wide-field optical imaging at high angular resolution, based on an 
array of relatively small (D ~ 1.5 m) telescopes with fast guiding capacity and a 



see http : //www-star . qmw . ac . uk/ ~ jpe/vista/ 
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"rubber" focal plane. 

Associated with this instrumental progress is the evolution of data-analysis ca- 
pabilities. Whereas a small-format CCD image can be reduced and analysed 'by 
hand', this is no longer true for the large-format CCD images. Semi-automatic 
data-reduction pipelines will become necessary to keep up with the data flow. These 
pipelines, once properly developed and tested, can lead to a more 'objective' data 
analysis. In addition, specialised software, such as for the measurement of shapes 
of faint galaxies, can be implemented, together with tools which allow a correction 
for PSF anisotropics and smearing. 

Staying with instrumental developments for one more moment, the two planned 
CMB satellite missions (MAP and Planck Surveyor) will provide maps of the CMB 
at an angular resolution and a signal-to-noise ratio which will most likely lead to the 
detection of lensing by the large-scale structure on the CMB, as described in Sect. 9. 
Last but not least, the currently planned Next Generation Space Telescope (NGST, 
Kaldeich 1999), with a projected launch date of 2008, will provide a giant step in 
many fields of observational astronomy, not the least for weak lensing. It combines 
a large aperture (of order eight meters) with a position far from Earth to reduce sky 
background and with large-format imaging cameras. Even a relatively short expo- 
sure with the NGST, which will be optimised for observations in the near-infrared, 
will return images with a number density of several hundred background galaxies 
per square arc minute, for which a shape can be reliably measured; more accurate 
estimates are presently not feasible due to the large extrapolation into unknown 
territory. Comparing this number with the currently achievable number density in 
ground-based observations of about 30 per square arc minute, NGST will revolu- 
tionise this field.[^ In addition, the corresponding galaxies will be at much higher 
mean redshift than currently observable galaxy samples. Taken together, these two 
facts imply that one can detect massive haloes at medium redshifts with only half 
the velocity dispersion currently necessary to detect them with ground-based data, 
or that the investigations of the mass distribution of haloes can be extended to much 
higher redshifts than currently possible (see Schneider & Kneib 1998). The ACS 
on board HST will provide an encouraging hint of the increase in capabilities that 
NGST has to offer. 

Progress may also come from somewhat unexpected directions. Whereas the 
Sloan Digital Sky Survey (SDSS; e.g. Szalay 1998) will be very shallow com- 
pared to more standard weak-lensing observations, its huge angular coverage 
may compensate for it (Stebbins 1996). The VLA-FIRST survey of radio sources 

Whereas with the 8- to 10-meter class ground-based telescopes deeper images can be 
obtained, this does not drastically affect the 'useful' number density of faint galaxy images. 
Since fainter galaxies also tend to become smaller, and since a reliable shape estimate of a 
galaxy is feasible only if its size is not much smaller than the size of the seeing disk, very 
much deeper images from the ground will not yield much larger number densities of galaxy 
images which can be used for weak lensing. 
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(White et al. 1997) suffers from the sparsely populated radio sky, but this is also 
compensated by the huge sky coverage (Refregier et al. 1998). The use of both sur- 
veys for weak lensing will depend critically on the level down to which the sys- 
tematics of the instrumental image distortion can be understood and compensated 
for. 

Gravitational lensing has developed from a stand-alone research field into a versa- 
tile tool for observational cosmology, and this also applies to weak lensing. But, 
whereas the usefulness of strong lensing is widely accepted by the astronomical 
community, weak lensing is only beginning to reach that level of wide apprecia- 
tion. Part of this difference in attitude may be due to the fact that strong-lensing 
effects, such as multiple images and giant arcs, can easily be seen on CCD images, 
and their interpretation can readily be explained also to the non-expert. In contrast, 
weak lensing effects are revealed only through thorough statistical analysis of the 
data. Furthermore, the number of people working on weak lensing on the level of 
data analysis is still quite small, and the methods used to extract shear from CCD 
data are rather intricate. However, the analysis of CMB data is certainly more com- 
plicated than weak lensing analyses, but there are more people in the latter field, 
who checked and cross-checked their results; also, more people implies that much 
more development has gone into this field. Therefore, what is needed in weak lens- 
ing is a detailed comparison of methods, preferably by several independent groups, 
analysing the same data sets, together with extensive work on simulated data to 
investigate down to which level a very weak shear can be extracted from them. Up 
to now, no show-stopper has been identified which prohibits the detection of shear 
at the sub-percent level. 

Weak-lensing results and techniques will increasingly be combined with other 
methods. A few examples may suffice to illustrate this point. The analysis of galaxy 
clusters with (weak) lensing will be combined with results from X-ray measure- 
ments of the clusters and their Sunyaev-Zel'dovich decrement. Once these meth- 
ods are better understood, in particular in terms of their systematics, the question 
will no longer be, "Are the masses derived with these methods in agreement?", 
but rather, "What can we learn from their comparison?" For instance, while lens- 
ing is insensitive to the distribution of matter along the line-of-sight, the X-ray 
emission is, and thus their combination provides information on the depth of the 
cluster (see, e.g., Zaroubi et al. 1998). One might expect that clusters will continue 
for some time to be main targets for weak-lensing studies. In addition to clusters 
selected by their emission, mass concentrations selected only by their weak-lensing 
properties shall be investigated in great detail, both with deeper images to obtain a 
more accurate measurement of the shear, and by X-ray, IR, sub-mm, and optical/IR 
multi-colour techniques. It would be spectacular, and of great cosmological signif- 
icance, to find mass concentrations of exceedingly high mass-to-light ratio (well in 
excess of 1,000 in solar units), and it is important to understand the distribution of 
M/L for clusters. A first example may have been found by Erben et al. (1999). 
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As mentioned before, weak lensing is able to constrain the redshift distribution 
of very faint objects which do not allow spectroscopic investigation. Thus, lens- 
ing can constrain extrapolations of the z-distribution, and the models for the red- 
shift estimates obtained from multi-colour photometry ('photometric redshifts'). 
On the other hand, photometric redshifts will play an increasingly important role 
for weak lensing, as they will allow to increase the signal-to-noise ratio of local 
shear measurements. Furthermore, if source galaxies at increasingly higher red- 
shifts are considered (as will be the case with the upcoming giant telescopes, 
cf. Clowe et al. 1998), the probability increases that more than one deflector lies 
between us and this distant screen of sources. To disentangle the corresponding pro- 
jection effects, the dependence of the lensing strength on the lens and source red- 
shift can be employed. Lenses at different redshifts cause different source-redshift 
dependences of the measured shear. Hence, photometric redshifts will play an in- 
creasingly important role for weak lensing. Whereas a fully three-dimensional mass 
distribution will probably be difficult to obtain using this relatively weak redshift 
dependence, a separation of the mass distribution into a small number of lens planes 
appears feasible. 



Combining results from cosmic-shear measurements with the power spectrum of 
the cosmic density fluctuations as measured from the CMB will allow a sensitive 
test of the gravitational instability picture for structure formation. As was pointed 
out by Hu & Tegmark (1999), cosmic-shear measurements can substantially im- 
prove the accuracy of the determination of cosmological parameters from CMB 
experiments, in particular by breaking the degeneracies inherent in the latter (see 
also Metcalf & Silk 1998). The comparison between observed cosmic shear and 
theory will at least partly involve the increasingly detailed numerical simulations 
of cosmic structure evolution, from which predictions for lensing observations can 
directly be obtained. For example, if the dark matter haloes in the numerical simu- 
lations are populated with galaxies, e.g., by using semi-empirical theories of galaxy 
evolution (Kauffmann et al. 1997), detailed prediction for galaxy-galaxy lensing 
can be derived and compared with observations, thus constraining these theories. 
The same numerical results will predict the relation between the measured shear 
and the galaxy distribution on larger scales, which can be compared with the ob- 
servable correlation between these quantities to investigate the scale- and redshift 
dependence of the bias factor. 



The range of applications of weak lensing will grow in parallel to the new instru- 
mental developments. Keeping in mind that many discoveries in gravitational lens- 
ing were not really expected (like the existence of Einstein rings, or giant luminous 
arcs), it seems likely that the introduction and extensive use of wide-field cameras 
and giant telescopes will give rise to real surprises. 



209 



Acknowledgements 

We are deeply indebted to Lindsay King and Shude Mao for their very careful 
reading of the manuscript and their numerous constructive remarks. 



210 



References 



Abell, G. O., 1958, ApJS, 3, 211 

Antonucci, R., 1993, ARA&A, 31, 473 

Arnaboldi, M., Capaccioli, M., Mancini, D., Rafanelli, R, Scaramella, R., & Sedmak, G., 
1998, in Wide-Field Surveys in Cosmology, 14th lAP meeting. Editions Frontieres 

Babul, A. & Lee, M. H., 1991, MNRAS, 250, 407 

Bahcall, N. & Fan, X., 1998, ApJ, 504, 1 

Bahcall, N. A., 1977, ARA&A, 15, 505 

Bahcall, N. A., 1988, ARA&A, 26, 631 

Banday, A. J., Gorski, K. M., Bennett, C. L., Hinshaw, G., Kogut, A., Lineweaver, C., 
Smoot, G. F, & Tenorio, L., 1997, ApJ, 475, 393 

Bardeen, J. M., 1980, PhRvD, 22, 1882 

Bardeen, J. M., Bond, J. R., Kaiser, N., & Szalay, A. S., 1986, ApJ, 304, 15 
Bartelmann, M., 1995a, A&A, 303, 643 
Bartelmann, M., 1995b, A&A, 298, 661 

Bartelmann, M., Ehlers, J., & Schneider, R, 1993, A&A, 280, 351 

Bartelmann, M., Huss, A., Colberg, J. M., Jenkins, A., & Pearce, F. R., 1998, A&A, 330, 1 

Bartelmann, M. & Narayan, R., 1995, ApJ, 451, 60 

Bartelmann, M., Narayan, R., Seitz, S., & Schneider, P., 1996, ApJ, 464, LI 15 

Bartelmann, M. & Schneider, P, 1991, A&A, 248, 349 

Bartelmann, M. & Schneider, P, 1992, A&A, 259, 413 

Bartelmann, M. & Schneider, R, 1993a, A&A, 268, 1 

Bartelmann, M. & Schneider, P, 1993b, A&A, 271, 421 

Bartelmann, M. & Schneider, P, 1994, A&A, 284, 1 

Bartelmann, M. & Schneider, P, 1999, A&A, 345, 17 

Bartelmann, M., Schneider, R, & Hasinger, G., 1994, A&A, 290, 399 

Bartsch, A., Schneider, P., & Bartehnann, M., 1997, A&A, 319, 375 

Begehnan, M. C., Blandford, R. D., & Rees, M. J., 1984, Rev. Mod. Phys., 56, 255 

Bemtez, N. & Martmez-Gonzalez, E., 1995, ApJ, 448, L89 



211 



Bemardeau, R, 1997, A&A, 324, 15 

Bemardeau, R, Van Waerbeke, L., & Mellier, Y., 1997, A&A, 322, 1 
Bertin, E. & Arnouts, S., 1996, A&AS, 117, 393 

Biggs, A., Browne, I., Helbing, P., Koopmans, L., Wilkinson, P., & Perley, R., 1998, astro- 
ph/98 11282 

BinggeU, B., Sandage, A., & Tammann, G. A., 1988, ARA&A, 26, 509 
Birkinshaw, M. & Gull, S. R, 1983, Nat, 302, 315 
Blanchard, A. & Schneider, J., 1987, A&A, 184, 1 
Blandford, R. & Narayan, R., 1986, ApJ, 310, 568 

Blandford, R. D., Cohen, J., Kundic, T., Brainerd, T., & Hogg, D., 1998, in American 
Astronomical Society Meeting 192, #07.04 

Blandford, R. D. & Narayan, R., 1992, ARA&A, 30, 311 

Blandford, R. D., Netzer, H., & Woltjer, L., 1990, Active galactic nuclei, Springer Verlag, 
Berlin 

Blandford, R. D., Saust, A. B., Brainerd, T. G., & Villumsen, J. V., 1991, MNRAS, 251, 
600 

Bonnet, H., Fort, B., Kneib, J.-R, Mellier, Y., & Soucail, G., 1993, A&A, 280, L7 

Bonnet, H. & Mellier, Y, 1995, A&A, 303, 331 

Borgeest, U., v. Linde, J., & Refsdal, S., 1991, A&A, 251, L35 

Bomer, G., 1988, The early universe. Springer, Heidelberg 

Bomer, G. & Ehlers, J., 1988, A&A, 204, 1 

Bouchet, R R., Juszkiewicz, R., Colombi, S., & Pellat, R., 1992, ApJ, 394, L5 

Bower, R. G. & Smail, 1., 1997, MNRAS, 290, 292 

Boyle, B. J., Shanks, T., & Peterson, B. A., 1988, MNRAS, 235, 935 

Brainerd, T. G., Blandford, R. D., & Smail, I., 1996, ApJ, 466, 623 

Brainerd, T. G., Smail, I., & Mould, J., 1995, MNRAS, 275, 781 

Bridle, S. L., Hobson, M. R, Lasenby, A. N., & Saunders, R., 1998, MNRAS, 299, 895 

Broadhurst, T., 1995, preprint astro-ph/9511150 

Broadhurst, T., Ellis, R., Koo, D., & Szalay, A., 1990, Nat, 343, 726 

Broadhurst, T. J., ElUs, R. S., & Shanks, T., 1988, MNRAS, 235, 827 

Broadhurst, T. J., Taylor, A. N., & Peacock, J. A., 1995, ApJ, 438, 49 



212 



Buta, R., Mitra, S., De Vaucouleurs, G., & Corwin Jr., H. G., 1994, AJ, 107, 118 

Carlberg, R. G., Yee, H. K. C, & EUingson, E., 1994, ApJ, 437, 63 

Carroll, S. M., Press, W. H., & Turner, E. L., 1992, ARA&A, 30, 499 

Cayon, L., Martinez-Gonzalez, E., & Sanz, J. L., 1993a, ApJ, 413, 10 

Cayon, L., Martinez-Gonzalez, E., & Sanz, J. L., 1993b, ApJ, 403, 471 

Clowe, D., Luppino, G. A., Kaiser, N., Henry, J. R, & Gioia, I. M., 1998, ApJ, 497, L61 

Cohen, J. G., Blandford, R. D., Hogg, D. W., Pahre, M. A., & Shopbell, P, 1999, ApJ, 512, 
30 

Cole, S. & Efstathiou, G., 1989, MNRAS, 239, 195 

Coles, R, Lucchin, P., Matarrese, S., & Moscardini, L., 1998, MNRAS, 300, 183 

Colless, M., 1998, in S. Colombi, Y. Mellier, and B. Raban (eds.), Wide-Field Surveys in 
Cosmology, p. 77, Editions Frontieres 

Colless, M., ElUs, R. S., Broadhurst, T. J., Taylor, K., & Peterson, B. A., 1993, MNRAS, 
261, 19 

Colless, M., Ellis, R. S., Shaw, G., & Taylor, K., 1991, MNRAS, 253, 686 

Connolly, A. J., Csabai, I., Szalay, A. S., Koo, D. C, Kron, R. G., & Munn, J. A., 1995, 
AJ, 110, 2655 

Crampton, D., Le Fevre, O., Lilly, S. J., & Hammer, R, 1995, ApJ, 455, 96 

Dalcanton, J. J., Canizares, C. R., Granados, A., Steidel, C. C, & Stocke, J. T., 1994, ApJ, 
424, 550 

Dashevskii, V. M., Zel'dovich, & Ya.B., 1965, SvA, 8, 854 
Davis, M. & Peebles, P J. E., 1983, ApJ, 267, 465 

de Vaucouleurs, G., de Vaucouleurs, A., Corwin, J. R., Buta, R. J., Paturel, G., & Fouque, 
P., 1991, Third reference catalogue of bright galaxies, version 9, Springer Verlag, New 
York 

Deiser, N., 1995, Statistische Methoden zur Bestimmung des Linsenparameters, Diploma- 
thesis, LMU Miinchen 

Dell'Antonio, 1. P & Tyson, J. A., 1996, ApJ, 473, L17 
Dolag, K. & Bartehnann, M., 1997, MNRAS, 291, 446 

Donahue, M., Voit, G. M., Gioia, I., Luppino, G., Hughes, J. P., & Stocke, J. T., 1998, ApJ, 
502, 550 

Dressier, A. & Gunn, J. E., 1992, ApJS, 78, 1 
Dye, S. & Taylor, A., 1998, MNRAS, 300, L23 



213 



Dyer, C. C, 1976, MNRAS, 175, 429 

Dyer, C. C. & Roder, R. C, 1973, ApJ, 180, L31 

Ebbels, T. et al., 1998, MNRAS, 295, 75 
Ebbels, T. M. D. et al., 1996, MNRAS, 281, L75 

Eddington, A. S., 1920, Space, time and gravitation, Cambridge University Press, 
Cambridge 

Efstathiou, G., 1996, p. 133, North-Holland, Elsevier 

Efstathiou, G., Ellis, R. S., & Peterson, B. A., 1988, MNRAS, 232, 431 

Einstein, A. & Strauss, E. G., 1945, Rev. Mod. Phys., 17, 20 

Eke, V. R., Cole, S., & Frenk, C. S., 1996, MNRAS, 282, 263 

ElUs, R. S., 1997, ARA&A, 35, 389 

Erben, T., 1997, Die Bestimmung von Galaxieneigenschaften durch Galaxy-Galaxy- 
Lensing, Diploma-thesis, TU Miinchen 

Erben, T., van Waerbeke, L., MelUer, Y., et al., 1999, A&A, in press, preprint astro- 
ph/9907134 

Etherington, I. M. H., 1933, Phil. Mag., 15, 761 

Faber, S. M. & Gallagher, J. S., 1979, ARA&A, 17, 135 

Faber, S. M. & Jackson, R. E., 1976, ApJ, 204, 668 

Fahhnan, G., Kaiser, N., Squires, G., & Woods, D., 1994, ApJ, 437, 56 

Falco, E. E., Gorenstein, M. V., & Shapiro, 1. 1., 1985, ApJ, 289, LI 

Falco, E. E., Kochanek, C. S., & Munoz, J. A., 1998, ApJ, 494, 47 

Fan, X., Strauss, M. A., Schneider, D. R, Gunn, J. E., et al., 1999, AJ, 118, 1 

Fischer, P, 1999, preprint, astro-ph/9901407 

Fischer, P, Bernstein, G., Rhee, G., & Tyson, J. A., 1997, AJ, 113, 521 
Fischer, P et al., 1999, AJ, submitted, preprint astro-ph/9912119 
Fischer, R & Tyson, J. A., 1997, AJ, 114, 14 

Fischer, P, Tyson, J. A., Bernstein, G. M., & Guhathakurta, P., 1994, ApJ, 431, L71 

Fixsen, D. J., Cheng, E. S., Gales, J. M., Mather, J. C, Shafer, R. A., & Wright, E. L., 1996, 
ApJ, 473, 576 

Forman, W. & Jones, C, 1982, ARA&A, 20, 547 
Fort, B. & MelUer, Y., 1994, A&AR, 5, 239 



214 



Fort, B., MelUer, Y., & Dantel-Fort, M., 1997, A&A, 321, 353 

Fort, B., Mellier, Y., Dantel-Fort, M., Bonnet, H., & Kneib, J.-R, 1996, A&A, 310, 705 
Fort, B., Prieur, J. L., Mathez, G., Mellier, Y., & Soucail, G., 1988, A&A, 200, L17 

Freedman, W. L., 1996, in Critical dialogs in cosmology, proc. Princeton 250th 

anniversary 

Friedmann, A., 1922, Z. Phys., 10, 377 
Friedmann, A., 1924, Z. Phys., 21, 326 
Frieman, J. A., 1996, Comments on Astrophysics 
Fry, J. N., 1984, ApJ, 279, 499 
Frye, B. & Broadhurst, T., 1998, ApJ, 499, LI 15 
Fugmann, W., 1990, A&A, 240, 11 

Fukugita, M., Futamase, T., Kasai, M., & Turner, E. L., 1992, ApJ, 393, 3 

Futamase, T., 1989, MNRAS, 237, 187 

Futamase, T. & Sasaki, M., 1989, Phys. Rev. D, 40, 2502 

Gautret, L., Fort, B., & Melher, Y, 1998, A&A, submitted, preprint astro-ph/98 12388 

Geiger, B. & Schneider, R, 1998, MNRAS, 295, 497 

Geiger, B. & Schneider, R, 1999, MNRAS, 302, 118 

Giovanelh, R. & Haynes, M. R, 1991, ARA&A, 29, 499 

Goroff, M. H., Grinstein, B., Rey, S.-J., & Wise, M. B., 1986, ApJ, 311, 6 

Gould, A., 1995, ApJ, 440, 510 

Gradshteyn, I. S. & Ryzhik, I. M., 1994, Table of integrals, series and products. Academic 
Press, 5th edition 

Griffiths, R. E., Casertano, S., Im, M., & Ratnatunga, K. U., 1996, MNRAS, 282, 1159 
Gunn, J. E., 1967, ApJ, 150, 737 

Gunn, J. E. & Knapp, G. R., 1993, in ASP Conf. Sen 43: Sky Surveys. Protostars to 

Protogalaxies, p. 267 

Gurvits, L. I. & Mitrofanov, I. G., 1986, Nat, 324, 349 

Gwyn, S. D. J. & Hartwick, F. D. A., 1996, ApJ, 468, L77 

Hamilton, A. J. S., Matthews, A., Kumar, R, & Lu, E., 1991, ApJ, 374, LI 

Hamuy, M., Phillips, M. M., Suntzeff, N. B., Schommer, R. A., Maza, J., & Aviles, R., 
1996, AJ, 112, 2391 



215 



Harrison, E. R., 1970, PhRvD, 1, 2726 

Hartwick, F. D. A. & Schade, D., 1990, ARA&A, 28, 437 

Hewitt, J. N., Turner, E. L., Schneider, D. P., Burke, B. F., Langston, G. 1., & Lawrence, 
C. R., 1988, Nat, 333, 537 

Hoekstra, H., Franx, M., Kuijken, K., & Squires, G., 1998, ApJ, 504, 636 

Hogg, D. W., Cohen, J. G., Blandford, R., Gwyn, S. D. J., Hartwick, F D. A., Mobasher, 
B., Mazzei, R, Sawicki, M., Lin, H., Yee, H. K. C., Connolly, A. J., Brunner, R. J., Csabai, 
I., Dickinson, M., Subbarao, M. U., Szalay, A. S., Femandez-Soto, A., Lanzetta, K. M., & 
Yahil, A., 1998, AJ, 115, 1418 

Holz, D. E., 1998, ApJ, 506, LI 

Hu, W., 1995, Ph.D. thesis, Univ. of Cahfornia, Berkeley 

Hu, W., 1999, ApJ, 522, L21 

Hu, W. & Tegmark, M., 1999, ApJ, 514, L65 

Huchra, J., Gorenstein, M., Kent, S., Shapiro, 1., Smith, G., Horine, E., & Perley, R., 1985, 
AJ, 90, 691 

Hudson, M. J., Gwyn, S. D. J., Dahle, H., & Kaiser, N., 1998, ApJ, 503, 531 
Hui, L., 1999, ApJ, 519, L9 

Jacobs, M. W., Linder, E. V., & Wagoner, R. V., 1993, Phys. Rev. D, 48, 4623 

Jain, B., Mo, H. J., & White, S. D. M., 1995, MNRAS, 276, L25 

Jain, B. & Seljak, U., 1997, ApJ, 484, 560 

Jain, B., Seljak, U., & White, S., 1999, astro-ph/9901191 

Jain, B. & van Waerbeke, L., 1999, preprint, astro-ph/99 10459 

Jaroszynski, M., 1991, MNRAS, 249, 430 

Jaroszynski, M., Park, C, Paczynski, B., & Gott, J. R., 1990, ApJ, 365, 22 

Jarvis, J. F. & Tyson, J. A., 1981, AJ, 86, 476 

Kaiser, N., 1984, ApJ, 284, L9 

Kaiser, N., 1992, ApJ, 388, 272 

Kaiser, N., 1995, ApJ, 439, LI 

Kaiser, N., 1998, ApJ, 498, 26 

Kaiser, N., 1999, ApJ, submitted, preprint astro-ph/9904003 
Kaiser, N. & Squires, G., 1993, ApJ, 404, 441 



216 



Kaiser, N., Squires, G., & Broadhurst, T., 1995, ApJ, 449, 460 

Kaiser, N., Squires, G., Fahlman, G., & Woods, D., 1994, in Clusters of galaxies, 
proc. XrVth Moriond astrophysics meeting, Meribel, France, 1994, p. 269 

Kaiser, N. & Stebbins, A., 1984, Nat, 310, 391 

Kaiser, N., Tonry, J. L., & Luppino, G. A., 1999, PASP, submitted, preprint astro- 
ph/9912181 

Kaiser, N., Wilson, G., Luppino, G., Kofman, L., Gioia, I., Metzger, M., & Dahle, H., 1998, 
preprint astro-ph/9809268 

Kaldeich, B. (ed.), 1999, The Next Generation Space Telescope: Science Drivers and 
Technological Challenges, 34th Liege Astrophysics Colloquium, ESA 

KashUnsky, A., 1988, ApJ, 331, LI 

Kassiola, A., Kovner, 1., & Fort, B., 1992, ApJ, 400, 41 

Kauffmann, G., Nusser, A., & Steinmetz, M., 1997, MNRAS, 286, 795 

Kneib, J.-P, ElUs, R. S., Small, I., Couch, W. J., & Sharpies, R. M., 1996, ApJ, 471, 643 

Kneib, J. P et al., 1995, A&A, 303, 27 

Kneib, J.-R, Mathez, G., Fort, B., Mellier, Y., Soucail, G., & Longaretti, R-Y., 1994, A&A, 
286, 701 

Kneib, J.-P., Mellier, Y, Fort, B., & Mathez, G., 1993, A&A, 273, 367 

Kochanek, C. S., 1990, MNRAS, 247, 135 

Kochanek, C. S., 1993, ApJ, 419, 12 

Kochanek, C. S., 1996, ApJ, 466, 638 

Koo, D. C. & Kron, R. G., 1992, ARA&A, 30, 613 

Kovner, 1. & Milgrom, M., 1987, ApJ, 321, LI 13 

Kristian, J. & Sachs, R. K., 1966, ApJ, 143, 379 

Krolik, J. H., 1999, Active galactic nuclei: from the central black hole to the galactic 
environment, Princeton University Press, Princeton, NJ 

Kruse, G. & Schneider, P, 1999a, preprint, astro-ph/9904192 

Kruse, G. & Schneider, P, 1999b, MNRAS, 302, 821 

Kuhr, H., Witzel, A., Pauliny-Toth, 1. 1. K., & Nauber, U., 1981, A&AS, 45, 367 

Kuijken, K., 1999, A&A, submitted, preprint astro-ph/9904418 

Kundic, T. et al., 1997, ApJ, 482, 75 

Lacey, C. & Cole, S., 1993, MNRAS, 262, 627 



217 



Lacey, C. & Cole, S., 1994, MNRAS, 271, 676 
Lifshitz, E. M., 1946, J. Phys. USSR, 10, 116 

Lilly, S. J., 1993, ApJ,411,501 

Lilly, S. J., Cowie, L. L., & Gardner, J. P., 1991, ApJ, 369, 79 

Lilly, S. J., Le Fevre, O., Crampton, D., Hammer, R, & Tresse, L., 1995, ApJ, 455, 50 

Limber, D. N., 1953, ApJ, 117, 134 

Linder, E. V., 1988, A&A, 206, 199 

Linder, E. V., 1990a, MNRAS, 243, 353 

Linder, E. V., 1990b, MNRAS, 243, 362 

Lombardi, M. & Berlin, G., 1998a, A&A, 330, 791 

Lombardi, M. & Berlin, G., 1998b, A&A, 335, 1 

Lombardi, M. & Berlin, G., 1999, A&A, 342, 337 

Loveday, J. & Pier, J., 1998, in S. Colombi, Y. Mellier, and B. Raban (eds.), Wide-Field 
Surveys in Cosmology, p. 317, Editions Fronlieres 

Lucy, L. B., 1994, A&A, 289, 983 

Luppino, G. A. & Kaiser, N., 1997, ApJ, 475, 20 

Lynds, R. & Pelrosian, V., 1986, BAAS, 18, 1014 

Lynds, R. & Pelrosian, V., 1989, ApJ, 336, 1 

Mao, S., 2000, in T. G. Brainerd and C. S. Kochanek (eds.). Gravitational Lensing: Recent 
Progress and Future Goals 

Mao, S. & Kochanek, C. S., 1994, MNRAS, 268, 569 

Maoz, D. & Rix, H.-W., 1993, ApJ, 416, 425 

Marlmez-Gonzalez, E., Sanz, J. L., & Silk, J., 1990, ApJ, 355, L5 

Marzke, R. O., GeUer, M. J., Huchra, J. P, & Corwin, H. G., J., 1994a, AJ, 108, 437 

Marzke, R. O., Huchra, J. P, & Geller, M. J., 1994b, ApJ, 428, 43 

MeUier, Y., 1998, preprinl aslro-ph/9812172 

Mellier, Y., Danlel-Forl, M., Fori, B., & Bonnel, H., 1994, A&A, 289, L15 
MelUer, Y, Fori, B., Soucail, G., Malhez, G., & Cailloux, M., 1991, ApJ, 380, 334 
Melcalf, R., 1999, MNRAS, 305, 746 
Melcalf, R. B. & Silk, J., 1997, ApJ, 489, 1 



218 



Metcalf, R. B. & Silk, J., 1998, ApJ, 492, LI 
Metcalf, R. B. & Silk, J., 1999, ApJ, 519, LI 

Miralda-Escude, J., 1991a, ApJ, 380, 1 
Miralda-Escude, J., 1991b, ApJ, 370, 1 

Misner, C. W., Thome, K. S., & Wheeler, J. A., 1973, Gravitation, Freeman, New York 

Mould, J., Blandford, R., Villumsen, J., Brainerd, T., Small, I., Small, T., & Kells, W., 1994, 
MNRAS, 271, 31 

Naim, A., Lahav, O., Buta, R. J., Corwin Jr., H. G., De Vaucouleurs, G., Dressier, A., 
Huchra, J. P., Van den Bergh, S., Raychaudhury, S., Sodre Jr., L., & Storrie-Lombardi, 
M. C, 1995a, MNRAS, 274, 1107 

Naim, A., Lahav, O., Sodre Jr., L., & Storrie-Lombardi, M. C, 1995b, MNRAS, 275, 567 
Narayan, R. & Bartelmann, M., 1997, preprint 
Narayan, R. & Nityananda, R., 1986, ARA&A, 24, 127 

Narayan, R. & Wallington, S., 1993, in Gravitational lenses in the universe, 31st Liege 
International Astrophysical Colloquium, June 1993, p. 217 

Natarajan, R, J.-R, K., Small, I., & Ellis, R. S., 1998, ApJ, 499, 600 

Natarajan, R & Kneib, J.-R, 1997, MNRAS, 287, 833 

Navarro, J., Frenk, C, & White, S., 1996, ApJ, 462, 563 

Navarro, J., Frenk, C, & White, S., 1997, ApJ, 486, 493 

Norman, D. J. & Impey, C. D., 1999, AJ, in press 

Norman, D. J. & Wilhams, L. L. R., 1999, ApJ, submitted, preprint astro-ph/9908177 
Nottale, L., 1984, MNRAS, 206, 713 
Ohanian, H. C, 1983, ApJ, 271, 551 
Paczynski, B., 1987, Nat, 325, 572 
Paczynski, B., 1996, ARA&A, 34, 419 

Padmanabhan, T., 1993, Structure formation in the Universe, CUP 
Peacock, J. A., 1997, MNRAS, 284, 885 

Peacock, J. A., 1999, Cosmological physics, Cambridge University Press 
Peacock, J. A. & Dodds, S. J., 1996, MNRAS, 280, L19 

Peebles, P. J. E., 1980, The large-scale structure of the Universe, Princeton University 
Press, Princeton, NJ 



219 



Peebles, P. J. E., 1993, Principles of physical cosmology, Princeton University Press 
Peebles, P J. E. & Yu, J. T., 1970, ApJ, 162, 815 
Pel, Y. C, 1995, ApJ, 438, 623 
Pello, R. et al., 1999, A&A, 346, 359 

Pello, R., Le Borgne, J.-F., Soucail, G., Mellier, Y, & Sanahuja, B., 1991, ApJ, 366, 405 
Penzias, A. A. & Wilson, R. W., 1965, ApJ, 142, 419 

Peterson, B. M., 1997, An introduction to active galactic nuclei, Cambridge University 
Press, Cambridge 

Phillips, M. M., 1993, ApJ, 413, L105 

Press, W. & Schechter, P, 1974, ApJ, 187, 425 

Press, W. H., Flannery, B. P, Teukolsky, S. A., & VetterUng, W. T., 1986, Numerical 
Recipes, Cambridge University Press, Cambridge 

Press, W. H. & Gunn, J. E., 1973, ApJ, 185, 397 

Pyne, T. & Birkinshaw, M., 1996, ApJ, 458, 46 

Reblinsky, K. & Bartelmann, M., 1999, A&A, 345, 1 

RebUnsky, K., Kruse, G., Jain, B., & Schneider, P., 1999, A&A, submitted 

Rees, M. J., 1984, ARA&A, 22, 471 

Rees, M. J. & Sciama, D. W., 1968, Nat, 217, 511 

Refregier, A., Brown, S. T., Kamionkowski, M., Helfand, D. J., Cress, C. M., Babul, A., 
Becker, R., & White, R. L., 1998, in Wide Field Surveys in Cosmology, 14th lAP meeting 
held May 26-30, 1998, Paris. Publisher: Editions Frontieres, p. 209 

Refsdal, S., 1964, MNRAS, 128, 307 

Refsdal, S. & Surdej, J., 1994, Rep. Prog. Phys., 56, 117 

Rhodes, J., Refregier, A., & Groth, E., 1999, ApJ, submitted, preprint astro-ph/9905090 
Richstone, D., Loeb, A., & Turner, E. L., 1992, ApJ, 393, 477 

Riess, A. G., FiUppenko, A. V., ChalUs, P., Clocchiatti, A., Diercks, A., Gamavich, P. M., 
GiUiland, R. L., Hogan, C. J., Jha, S., Kirshner, R. P., Leibundgut, B., Phillips, M. M., Reiss, 
D., Schmidt, B. P., Schommer, R. A., Smith, R. C, Spyromilio, J., Stubbs, C, Suntzeff, 
N. B., & Tonry, J., 1998, AJ, 116, 1009 

Riess, A. G., Press, W. H., & Kirshner, R. P, 1995, ApJ, 438, L17 
Riess, A. G., Press, W. H., & Kirshner, R. P, 1996, ApJ, 473, 88 
Rix, H.-W, Schneider, D. P, & Bahcall, J. N., 1992, AJ, 104, 959 



220 



Robertson, H. P., 1935, ApJ, 82, 284 

Rodrigues-Williams, L. L. & Hawkins, M. R. S., 1995, in Dark Matter, AIP conference 
proceedings 336, College Park, MD, 1994, p. 331 

Rodrigues-Williams, L. L. & Hogan, C. J., 1994, AJ, 107, 451 

Rood, H. J., 1981, RPPh, 44, 1077 

Roulet, E. & Mollerach, S., 1997, Phys. Rep., 279, 67 

Sachs, R. K. & Wolfe, A. M., 1967, ApJ, 147, 73 

Sanz, J. L., Martinez-Gonzalez, E., & Bemtez, N., 1997, MNRAS, 291, 418 

Sarazin, C. L., 1986, RvMP, 58, 1 

Sasaki, M., 1989, MNRAS, 240, 415 

Schechter, R, 1976, ApJ, 203, 297 

Schechter, R L. et al., 1997, ApJ, 475, L85 

Schindler, S. & Wambsganss, J., 1996, A&A, 313, 113 

Schindler, S. & Wambsganss, J., 1997, A&A, 322, 66 

Schneider, P, 1985, A&A, 143, 413 

Schneider, P, 1993, A&A, 279, 1 

Schneider, R, 1995, A&A, 302, 639 

Schneider, R, 1996a, lAU Symposia, 168, 209 

Schneider, R, 1996b, MNRAS, 283, 837 

Schneider, P, 1998, ApJ, 498, 43 

Schneider, R & Bartehnann, M., 1997, MNRAS, 286, 696 

Schneider, P., Ehlers, J., & Falco, E. E., 1992, Gravitational Lenses, Springer Verlag, 
Heidelberg 

Schneider, R, King, L., & Erben, T., 1999, A&A, submitted, preprint astro-ph/9907143 

Schneider, P. & Kneib, J.-R, 1998, in The Next Generation Space Telescope: Science 
Drivers and Technological Challenges, 34th Liege Astrophysics Colloquium, June 1998, 
p. 89 

Schneider, R & Rix, H.-W, 1997, ApJ, 474, 25 
Schneider, R & Seitz, C, 1995, A&A, 294, 411 

Schneider, R, van Waerbeke, L., Jain, B., & Kruse, G., 1998a, MNRAS, 296, 873 

Schneider, P, van Waerbeke, L., MeUier, Y., Jain, B., Seitz, S., & Fort, B., 1998b, A&A, 
333, 767 



221 



Schneider, P. & Wagoner, R. V., 1987, ApJ, 314, 154 

Schramm, T. & Kayser, R., 1995, A&A, 299, 1 

Scoccimarro, R. & Frieman, J., 1999, ApJ, in press 

Seitz, C, Kneib, J.-R, Schneider, R, & Seitz, S., 1996, A&A, 314, 707 

Seitz, C. & Schneider, R, 1995a, A&A, 297, 287 

Seitz, C. & Schneider, R, 1997, A&A, 318, 687 

Seitz, S., CoUodel, L., Rirzkal, N., et al., 1998a, in Wide field surveys in cosmology, p. 203, 
Editions Frontieres 

Seitz, S., SagUa, R., Bender, R., Hopp, U., Belloni, R, & Ziegler, B., 1998b, MNRAS, 298, 
325 

Seitz, S. & Schneider, R, 1992, A&A, 265, 1 

Seitz, S. & Schneider, R, 1995b, A&A, 302, 9 

Seitz, S. & Schneider, R, 1996, A&A, 305, 383 

Seitz, S. & Schneider, R, 1998, astro-ph/9802051 

Seitz, S., Schneider, R, & Bartelmann, M., 1998c, A&A, 337, 325 

Seitz, S., Schneider, R, & Ehlers, J., 1994, Class. Quantum Grav., 11, 2345 

Seldner, M., Siebers, B., Groth, E. J., & Peebles, R J. E., 1977, AJ, 82, 249 

Seljak, U., 1994, ApJ, 436, 509 

Seljak, U., 1996, ApJ, 463, 1 

Seljak, U., 1998, ApJ, 506, 64 

Seljak, U. & Holz, D., 1999, A&A, in press, preprint astro 

Small, I., Ellis, R. S., & Fitchett, M. J., 1994, MNRAS, 270, 245 

Small, I., Ellis, R. S., Fitchett, M. J., & Edge, A. C, 1995a, MNRAS, 273, 277 

Small, I., Hogg, D. W., Yan, L., & Cohen, J. G., 1995b, ApJ, 449, L105 

Smoot, G. F, 1997, Lectures held at Strasbourg NATO school on the CMB and cosmology, 
preprint astro-ph/9705135 

Smoot, G. F, Bennett, C. L., Kogut, A., et al, 1992, ApJ, 396, LI 

Soucail, G., Fort, B., MelUer, Y., & Picat, J. R, 1987a, A&A, 172, L14 

Soucail, G., MelUer, Y, Fort, B., Hammer, R, & Mathez, G., 1987b, A&A, 184, L7 

Soucail, G., MelUer, Y, Fort, B., Mathez, G., & Cailloux, M., 1988, A&A, 191, L19 

Squires, G. et al., 1997, ApJ, 482, 648 



222 



Squires, G. & Kaiser, N., 1996, ApJ, 473, 65 

Squires, G., Kaiser, N., Babul, A., Fahlman, G., Woods, D., Neumann, D., & Bohringer, 
H., 1996a, ApJ, 461, 572 

Squires, G., Kaiser, N., Fahlman, G., Babul, A., & Woods, D., 1996b, ApJ, 469, 73 

Stebbins, A., 1996, American Astronomical Society Meeting, 189, 8207 

Stebbins, A., McKay, T., & Frieman, J., 1996, in C. Kochanek and J. Hewitt (eds.), 
Astrophysical applications of gravitational lensing, p. 75, Kluwer Academic Publishers; 
Dordrecht 

Steidel, C. C, Adelberger, K. L., Dickinson, M., Giavalisco, M., Pettini, M., & Kellogg, 
M., 1998, ApJ, 492, 428 

Stickel, M. & Kiihr, H., 1993, A&AS, 100, 395 

Stickel, M., Kuhr, H., & Fried, J. W., 1993, A&AS, 97, 483 

Stocke, J. T., Morris, S. L., Gioia, 1. M., Maccacaro, T., Schild, R., Wolter, A., Fleming, 
T. A., & Henry, J. P., 1991, ApJS, 76, 813 

Szalay, A., 1998, American Astronomical Society Meeting, 192, 6405 

Taylor, A. N., Dye, S., Broadhurst, T. J., Bemtez, N., & van Kampen, E., 1998, ApJ, 501, 
539 

Tomita, K., 1989, PhRvD, 40, 3821 

Trager, S. C, Faber, S. M., Dressier, A., & Oemler, A., 1997, ApJ, 485, 92 

Tran, K.-V. H., Kelson, D. D., Van Dokkum, P, Franx, M., lllingworth, G. D., & Magee, 
D., 1999, ApJ, in press, preprint astro-ph/9902349 

TuUy, R. B. & Fisher, J. R., 1977, A&A, 54, 661 

Turner, E. L., Ostriker, J. P, & Gott, J. R., 1984, ApJ, 284, 1 

Tyson, J. A., 1986, AJ, 92, 691 

Tyson, J. A., 1988, AJ, 96, 1 

Tyson, J. A. & Seitzer, P, 1988, ApJ, 335, 552 

Tyson, J. A., Valdes, F, Jarvis, J. F, & Mills Jr., A. P, 1984, ApJ, 281, L59 
Tyson, J. A., Valdes, F, & Wenk, R. A., 1990, ApJ, 349, LI 
Valdes, R, Tyson, J. A., & Jarvis, J. R, 1983, ApJ, 271, 431 

van Haarlem, M. P. & van der Hulst, J. M. (eds.), 1999, Scientific Imperatives at centimeter 
and meter Wavelengths 

van Kampen, E., 1998, MNRAS, 301, 389 

Van Waerbeke, L., 1998, A&A, 334, 1 



223 



van Waerbeke, L., Bemardeau, R, & Mellier, Y., 1999, A&A, 342, 15 

Van Waerbeke, L., Mellier, Y., Schneider, P., Fort, B., & Mathez, G., 1997, A&A, 317, 303 

Viana, P. T. P & Liddle, A. R., 1996, MNRAS, 281, 323 

Villumsen, J., Freudling, W., & DaCosta, L., 1997, ApJ, 481, 578 

Voges, W., 1992, in Environment observation and climate modelling through international 
space projects. Space sciences with particular emphasis on high-energy astrophysics, p. 9 

Walker, A. G., 1935, Quant. Joum. Math. Oxford Sci., 6, 81 

Wallington, S., Kochanek, C. S., & Koo, D. C, 1995, ApJ, 441, 58 

Walsh, D., Carswell, R. P., & Weymann, R. J., 1979, Nat, 279, 381 

Wambsganss, J., Cen, R., & Ostriker, J. P, 1998, ApJ, 494, 29 

Wambsganss, J., Cen, R., Xu, G., & Ostriker, J. P, 1997, ApJ, 475, L81 

Warren, S. J. & Hewett, P C, 1990, RPPh, 53, 1095 

Watanabe, K. & Tomita, K., 1991, ApJ, 370, 481 

Waxman, E. & Miralda-Escude, J., 1995, ApJ, 451, 451 

Weedman, D. W., 1986, Quasar astronomy, Cambridge University Press, Cambridge 
Weinberg, S., 1972, Gravitation and cosmology, Wiley, New York 
Weinberg, S., 1976, ApJ, 208, LI 

White, R. L., Becker, R. H., Helfand, D. J., & Gregg, M. D., 1997, ApJ, 475, 479 
White, S. D. M., Davis, M., Efstathiou, G., & Frank, C. S., 1987, Nat, 330, 451 
White, S. D. M., Efstathiou, G., & Frenk, C. S., 1993, MNRAS, 262, 1023 
Williams, L. L. R. & Irwin, M., 1998, MNRAS, 298, 378 

Williams, R. E., Blacker, B., Dickinson, M., Dixon, W. V. D., Ferguson, H. C, Fruchter, 

A. S., GiavaHsco, M., Gilliland, R. L., Heyer, 1., Katsanis, R., Levay, Z., Lucas, R. A., 
McElroy, D. B., Petro, L., Postman, M., Adorf, H.-M., & Hook, R., 1996, AJ, 112, 1335 

Wilson, G., Cole, S., & Frenk, C. S., 1996, MNRAS, 280, 199 

Wu, X.-P, 1996, Fund. Cosm. Phys. 17, 1 

Wu, X.-P & Fang, L.-Z., 1996, ApJ, 461, L5 

Wu, X.-P & Han, J., 1995, MNRAS, 272, 705 

Zaldarriaga, M. & Seljak, U., 1998a, PhRvD, 58, 023003 

Zaldarriaga, M. & Seljak, U., 1998b, PhRvD, submitted, preprint astro-ph/98 10257 
Zaldarriaga, M., Spergel, D. N., & Seljak, U., 1997, ApJ, 488, 1 



224 



Zaritsky, D. & White, S. D. M., 1994, ApJ, 435, 599 

Zaroubi, S., Squires, G., Hoffman, Y., & Silk, J., 1998, ApJ, 500, L87 

Zel'dovich & Ya.B., 1964, SvA, 8, 13 
Zel'dovich, Y. B., 1972, MNRAS, 160, pi 

Zensus, J. A. & Pearson, T. J., 1987, Superluminal radio sources, Cambridge University 
Press, Cambridge 

Zwicky, R, 1933, Helv. Phys. Acta, 6, 110 

Zwicky, R, Herzog, E., & Wild, P., 1968, Catalogue of galaxies and of clusters of galaxies, 
California Institute of Technology, Pasadena 



225 



